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Preface 



Atomic clusters are aggregates of atoms or molecules with a well-defined size 
varying from a few constituents to several tens of thousands. Clusters are 
distinguished from bulk matter in so far as their properties are strongly affected 
by the existence of a surface involving a large fraction of the constituents and by 
the discreteness of their electronic excitations. On the one hand, the finite nature 
of the number of constiments leads to novel structural and thermodynamic 
properties with no equivalent in the bulk. On the other hand, clusters bridge the 
physics of atoms to the physics of bulk matter. Studying thoroughly die physical 
properties of clusters may shed new light on elementary excitations in solids and 
liquids. 

The physics and chemistry of atomic clusters and nanoparticles constitute a 
broad interdisciplinary domain. The field is growing fest, as witnessed by the 
number of publications and conferences or workshops. The directions of growth 
are numerous. Some involve pure basic science, others are motivated by more 
applied material science considerations. Within the fast development of new 
technologies one observes an unavoidable trend toward miniaturization in micro- 
and nano-electronics. One wishes to technologically master devices of sizes so 
small that their quantum specific properties are the most important. Specific 
magnetic properties of clusters or dots on surfaces might offer new materials for 
applications in high-density recording and memory devices. Similar trends are 
observed in other fields, such as catalysis, energy storage, and control of air 
pollution. 

The present school was focused on basic science. It gathered lecturers with 
research practice in atomic and molecular physics, condensed matter physics, 
nuclear physics, and chemistry and physical chemistry, not to mention 
computational physics. 

The field has developed rapidly thanks to a strong coupling between theory 
and experiment. Two predominantly experimental courses provided the 
opportunity to appreciate what is and will be experimentally possible. Patrick 
Martin presented a large experimental review, and Hellmut Haberland lectured 
on recent experimental observations of phase transitions in metal clusters. 

The theoretical courses covered three main domains: (i) electronic properties 
of metallic clusters and nanostructures, (ii) phases and phase changes of small 
systems, and (iii) chemical processes in nanoscale systems. George Bertsch gave 
a series of quite interactive lectures that introduced the students to basic 
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phenomena in metal clusters, quantum dots, fiillerenes and nanotubes. He 
presented the basic theoretical tools to study cluster properties, particularly 
electronic excitations. He emphasized the importance of simple models that 
large-scale ab initio calculations validate, as well as the universal features of 
finite quantum systems. 

The Density Functional Theory stands at a central position as a quantum 
mechanical method for practical studies of large molecules and clusters. Today, 
the combination of quantum mechanics and molecular mechanics using classical 
force fields allows us to understand biological systems to a large extent. Denis 
Salahub gave the state of the art of this theory, which he illustrated with 
chemical applications throughout his course. 

Metal clusters and nanosystems are excellent physical objects for 
illuminating the links between the quantum and classical worlds. Matthias Brack 
reviewed semiclassical methods of determining both average trends and quantum 
shell effects in the properties of finite fermionic systems. His course was 
centered on two important theoretical themes: (a) the Extended Thomas-Fermi 
Model, and (b) the Periodic Orbit Theory (POT). 

Pairing correlations in atomic nuclei are intimately related to the phenomena 
of superconductivity (and superfluidity) in macroscopic systems. Recent 
experiments on small metallic clusters also reveal pairing correlations. Hubert 
Flocard devoted his course to a thorough investigation of pairing correlations in 
finite systems within the state of the art theoretical models. 

Cluster and nanoparticle physics is also part of condensed matter physics. 
Matti Manninen focused his course on the physics of nanosystems from the point 
of view of condensed matter physics, emphasizing concepts and mesoscopic 
features that are common between finite systems and low-dimensional systems. 

Magnetic properties of small systems are very sensitive to the size, structure, 
and composition of the system. Within a pedagogical four-lecture course 
Gustavo Pastor provided a deUiiled account of the most powerful theoretical 
methods for efficiently describing the magnetic properties of clusters, 
particularly transition-metal clusters. The spin-fluctuation theory is quite 
appropriate for clusters since changes or fluctuations of structure are important. 
Scattering processes are also useful to gain insight on the many-body properties 
of finite systems, as demonstrated by Andrey Solov’yov in his two-lecture 
course. He emphasized the essential role of surface and volume plasmon modes 
in the formation of electron energy loss spectra. 

The thermodynamical properties of clusters are certainly of major 
importance. They require theoretical approaches to the concepts of melting, 
fi'eezing and phase changes in finite systems and their dependence on size. The 
notions of “phase-like” forms, coexistence, solid-liquid equilibrium, phase 
diagrams and their specific formulation in terms of the thermodynamical 
variables and functions in various ensembles require the development of new and 
sophisticated algorithms. 
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Computer simulations of cluster dynamics and thermodynamics have become 
a major tool for predicting and understanding the finite temperature behavior of 
clusters. David Wales' course was concerned with concepts and recent methods 
of achieving topological analysis and sampling of complex multi-dimensional 
potential energy landscapes. 

Sergei Chekmarev described a novel approach to the computer simulation 
study of a finite many-body system that allows one to gain detailed information 
about this system, including its potential energy surface, equilibrium properties 
and kinetics. Biomolecules can now be investigated with a high degree of 
confidence. 

The chemical processes in or with gas phase clusters and nanoscale particles 
are a growing field of cluster science. The key to understanding the properties of 
various families of atomic clusters lies in the determination and description of 
the chemical bond. Concepts such as valence and valence change, bond 
directionality, hybridization, hypervalence, electronic population distribution and 
charge fluctuation are essential. The comse of Pavel Hobza addressed the above 
concepts, methods and mechanisms. Lucjan Piela focused his lecture (not 
published here) on cooperativity effects in quantum chemistry. This subject is 
indeed topical for molecular clusters. 

In ad^tion to the main courses that are the contents of this book, there were 
seminars not published herein. These seminars, that triggered stimulating 
discussions, were given by Jacqueline Belloni, Stephen Berry, Catherine 
Brdchignac, Vlasta Bonacic-Koutecky, Frank Hekking, Joshua Jortner, Vitaly 
Kresin, Richard Lavery, Eric Suraud, and Ludger Woeste. 

During the four weeks the lecturers and students got to know each other 
pretty well. Hopefully every student discovered and shared the excitement that 
his (her) colleagues from other fields experienced. More practically, one would 
have realized that some as yet unfamiliar methods could be of great interest for 
one’s own research. The overall organization of the school provided the best 
conditions to meet these goals. The daily lecture program was kept light, with no 
more than three main courses and sometimes an extra seminar. This schedule 
allowed plenty of time for discussions and organizing small working groups. We 
even observed that some collaborative research work had started. On behalf of 
all the lecturers and students we would like to dedicate this summer school to the 
late Professor Walter D. Knight. Walter Knight and his group at Berkeley 
pioneered the experimental field of cluster beams. His experiments in the 1980s 
led to major discoveries relating to finite size effects and to a strong revival of 
cluster physics. 

This School would not have been possible without; 

the help and the support of the Scientific Board of the “Ecole de 
Physique”; 

the staff of the “Ecole de Physique”: Ghyslaine d'Henry, Isabel Lelidvre 
and Brigitte Rousset; 




the financial support of the Scientific and Environmental Affairs 
Division of NATO, the Research Directorate of the European 
Commission, and the Formation Permanente of CNRS; and 
the support to the “Ecole de Physique” by the University Joseph Fourier, 
the French Ministry of Research, CNRS and CEA. 



C. Guet 
P. Hobza 
F. Spiegelman 
F. David 
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EXPERIMENTAL ASPECTS OF METAL CLUSTERS 



T.P. Martin 



1 Introduction 

It is not obvious that metal clusters should behave like atomic nuclei - 
but they do. Of course the energy and distance scales are quite differ- 
ent. But aside from this, the properties of these two forms of condensed 
matter are amazingly similar. The shell model developed by nuclear physi- 
cists describes very nicely the electronic properties of alkali metal clusters. 
The giant dipole resonances in the excitation spectra of nuclei have their 
analogue in the plasmon resonances of metal clusters. Finally, the droplet 
model describing the fission of unstable nuclei can be successively applied 
to the fragmentation of highly charged metal clusters. The similarity be- 
tween clusters and nuclei is not accidental. Both systems consist of fermions 
moving, nearly freely, in a confined space. 

Many years ago it was noticed that atomic nuclei containing either 8, 
20, 50, 82 or 126 protons have very long lifetimes. It was a challenge for the 
nuclear physicists back in the forties to explain these so-called magic num- 
bers. Since physicists tend to see most objects as perfectly round, it should 
come as no surprise that they assumed atomic nuclei are spherically sym- 
metric. Under this assumption they had to solve only a radial Schrodinger 
equation. 



dr^ 



+ 1 ) 



V{r) 



Pni{r) 






( 1 . 1 ) 



where ^ is the angular momentum quantum number and V (r) is the radial 
dependence of the potential in which the nucleons move. They assumed 
further that the potential could be described by a simple potential well. 
Some confusion can arise because nuclear physicists and atomic physicists 
use slightly different definitions for the principal quantum number n. 



n (atomic) = n (nuclear) -|- (1.2) 

© EDP Sciences, Springer- Verlag 2001 




4 



Atomic Clusters and Nanoparticles 




Fig. 1. The degeneracy of states of the infinitely deep spherical well on a mo- 
mentum scale. The total number of fermions needed to fill all states up to and 
including a given subshell is indicated above each bar. 



Throughout this lecture we will use the principal quantum number from 
nuclear physics, z. e., n denotes the number of extrema in the raidal wave- 
function. 

Eigenstates of the radial Schrodinger equation are often called subshells. 
The subshells of the infinite spherical potential well are shown ordered ac- 
cording to momentum in Figure 1. The lowest energy state is Is then comes 
Ip, Id, If, 2p..., etc. This is, with 2, 8, 18, 20, 34, 40, 58, 90... nucleons, 
subshells are completely filled and the corresponding nuclei could be ex- 
pected to be exceptionally stable. However, these are not the observed 
magic numbers. 

In 1949 Goeppert-Mayer [1] and Haxel et al. [2] came up with a modified 
model which yielded the observed magic numbers. Their idea was that the 
spin-orbit interaction is unusually strong for nucleons. Subshells with high 
angular momentum split and the states rearrange themselves into different 
groups. As we shall see the original shell model, which the nuclear physicist 
had to discard, describes very nicely the electronic states of metal clusters 
[3-18]. 

2 Subshells, shells and supershells 

If it can be assumed that the electrons in metal clusters move in a spherically 
symmetric potential, the problem is greatly simplified. Subshells for large 
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values of angular momentum can contain hundreds of electrons having the 
same energy. The highest possible degeneracy assuming cubic symmetry is 
only 6. So under spherical symmetry the multitude of electronic states con- 
denses down into a few degenerate subshells. Each subshell is characterized 
by a pair of quantum numbers n and £. Under certain circumstances the 
subshells themselves condense into a smaller number of highly degenerate 
shells . The reason for the formation of shells out of subshells requires more 
explanation. 

The concept of shells can be associated with a characteristic length. 
Every time the radius of a growing cluster increases by one unit of this 
characteristic length, a new shell is said to be added. The characteristic 
length for shells of atoms is approxiamtely equal to the interatomic distance. 
The characteristic length for shells of electrons is related to the wavelength 
of an electron in the highest occupied energy level (Fermi energy). For the 
alkali metals these lengths differ by a factor of about 2. This concept is 
useful only because the characteristic lengths are, to a first approximation, 
independent of cluster size. 

The concept of shells can also be described in a different manner. An 
expansion of N , the total number of electrons, in terms of the shell index K 
will always have a leading term proportional to . One power of K arises 
because we must sum over all shells up to K in order to obtain the total 
number of particles. One power of K arises because the number of subshells 
in a shell increases approximately linearly with shell index. Finally, the third 
power of K arises because the number of particles in the largest subshell 
also increases with shell index. Expressing this slightly more quantitatively, 
the total number of particles needed to fill all shells, fc, up to and including 
K is 



K L(k) 

Nk = '^'^2{2e=l) r-. (2.1) 

i=0 

where L{k) is the highest angular momentum subshell in shell k. 

Shell structure is not necessarily an approximate and infrequent bunch- 
ing of states as in the example of the spherical potential well. Figure 1. 
Clearly, almost none of the subshells occur exactly at the same energy for 
this potential. Shell structure can be the result of exactly overlapping states. 
Such degeneracies signal the presence of a symmetry higher than spherical 
symmetry. Subshells of hydrogen for which n + i have the same value, 
have exactly the same energy. This additional degeneracy in the states of 
hydrogen is a result of the form of its potential, 1/r, which bestows on 
hydrogen 0(4) symmetry. Subshells of the spherical harmonic oscillator for 
which 2n + £ have the same value also have exactly the same energy due to 
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n 




RhME/h 



Fig. 2. The states of the infinitely deep sphereical well for very large values of 1 . 
Notice the periodic bunching of states into shells. This periodic pattern is referred 
to as supershell structure. 



the form of the potential, and the resulting symmetry, SU(3). For this 
reason it is said that these systems, hydrogen and oscillator, have quantum 
numbers n + i and 2n + £ that determine the energy. We have shown that 
in + £ is an approximate energy quantum number for alkali metal clus- 
ters [16]. As the cluster increases in size, electron motion quantized in this 
way would finally be described as a closed triangular trajectory [19]. 



The grouping of large subshells into shells is illustrated in Figure 2 for 
the spherical potential well. Here, it can again be seen that in certain energy 
or momentum regions the subshells bunch together. However, the states are 
so densely packed in this figure that the effect is perceived as an alternat- 
ing light-dark pattern. That is, for the infinite potential well, bunching of 
states occurs periodically on the momentum scale. The periodic appearance 
of shell structure is referred to as supershell structure [20,21]. Although su- 
pershell structure was predicted by nuclear physicists more than 15 years 
ago, it has never been observed in nuclei. The reason for this is very sim- 
ple. The first supershell beat or interference occurs for a system containing 
800 fermions. There exist, of course, no nuclei containing so many protons 
and neutrons. It is possible, however, to produce metal clusters containing 
such large numbers of electrons. 
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Fig. 3. Apparatus for the production, photoionization and time-of-flight mass 
analysis of metal clusters. 



3 The experiment 

The technique we have used to study shell structure in metal clusters is pho- 
toionization time-of-flight (TOF) mass spectrometry, Figure 3. The mass 
spectrometer has a mass range of 600 000 amu and a mass resolution of 
up to 20 000. The cluster source is a low-pressure, rare gas, condensation 
cell. Sodium vapor was quenched in cold He gas having a pressure of about 
1 mbar. Clusters condensed out of the quenched vapor were transported by 
the gas stream through a nozzle and through two chambers of intermediate 
pressure into a high vacuum chamber. The size distribution of the clus- 
ters could be controlled by varying the oven-to-nozzle distance, the He gas 
pressure, and the oven temperature. The clusters were photoionized with a 
laser pulse. 

Since phase space in the ion optics is anisotropically occupied at the 
moment of ionization, a quadrupole pair is used to focus the ions onto the 
detector. All ions in a volume of 1 mm^ that have less than 500 eV kinetic 
energy at the moment of ionization are focused onto the detector [22]. 

The reflector consists of two segments with highly homogeneous electric 
fields, separated by wire meshes. The first segment, which is twice com- 
pletely traversed by the ions, is called the retarding held, and the other 
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segment is called reflecting fleld. This two-stage reflector allows a second- 
order time focusing of ions [23]. Two channel plates in series are used to 
detect the ions. The secondary electrons are collected on a metal plate and 
conducted to the electronics. The following main design features of the 
instrument are necessary to achieve such a resolution [24]: 

1. the ions are accelerated at right angles to the neutral cluster beam. 
If clusters are ionized by a laser pulse from the gas phase, there will 
always be a distribution of initial potential energies. The reflector is 
used to compensate for these. If the neutral beam is parallel to the 
acceleration direction, there is also an initial distribution of kinetic 
energies or velocity components parallel to the acceleration diretion. 
If the reflector is used to compensate for the initial potential energy, 
it cannot also compensate for the kinetic energies; 

2. a long (29 cm) retarding fleld segment is used in the reflector. In the 
vicinity of the wire meshes at the end of the two reflector segments the 
electric fleld is not perfectly homogeneous. This causes a slight de- 
flection of ions passing through them and thus a small time error. By 
using along retarding fleld segment, the fleld in he vicinity of the wire 
meshes is lowered, and the deflection of ions passing through them is 
reduced. 



The mass spectra which will be displayed in this paper cover a large range of 
masses. For this reason it will not be possible to distinguish the individual 
mass peaks. For example, at the top of Figure 4 we have reproduced a mass 
spectrum of Cs-0 clusters which appears to be nothing more than a black 
smudge. How do we know how many oxygen atoms th clusters contain? 
This can be seen by graphically expanding the scale by a factor of 100, 
Figure 4. Because of the high resolution of our mass spectrometer, we are 
quite certain about the composition of the clusters examined. 

4 Observation of electronic shell structure 

Knight et al. [3] first reported electronic shell structure in sodium clusters 
in 1984. Electronic shell structure can be demonstrated experimentally in 
several ways: as an abrupt decrease in the ionization energy with increasing 
cluster size, as an abrupt increase or an abrupt decrease in the intensity of 
peaks in mass spectra. The first type of experiment can be easily under- 
stood. Electrons in newly opened shells are less tightly bound, i.e., have 
lower ionization energies. However, considerable experimental effort is re- 
quired to measure the ionization energy of even a single cluster. A complete 
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Fig. 4. Mass spectrum of Cs-0 clusters. Notice that the exact composition can be 
determined on an expanded mass scale. 

photoionization spectrum must be obtained and very often an appropriate 
source of tunable light is simply not available. It is much easier to observe 
shell closings in photoionization, TOF mass spectra. However, depending 
upon the intensity and wavelength of the ionizing laser pulse, the new shell 
is announced by either an increase or a decrease in mass peak height. 

For high laser intensities, multiple-photon processes cause the mass spec- 
tra to be less wavelength sensitive and also cause considerable fragmentation 
of large clusters. The resulting mass spectrum reflects the stability of clus- 
ter ion fragments. Clusters with newly opened shells are less stable and are 
weakly represented in the mass spectra. Notice in Figure 5 that as each new 
shell is opened there is a sharp step downward in the mass spectrum. Re- 
member that cluster ions containing 9, 21, 41, 59, ... sodium atoms contain 
the magic number (8, 20, 40, 58, ...) of electrons. 

For low laser fluence and wavelengths near the ionization threshold the 
mass spectra have a completely different character. As each new shell is 
opened there is a sharp step upward in the mass spectra. Figure 6 (top). 
Open shell clusters have low ionization thresholds which fall below the en- 
ergy of the incident photons, while closed shell clusters remain unionized. 
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Fig. 5. Mass spectrum of (Na)+ clusters ionized with high-intensity, 2.53 eV light. 
The clnsters are fragmented by the ionizing laser. Fragments having closed-shell 
electronic configurations are particularly stable. 

Finally, for low laser fluence and wavelengths well above the ionization 
threshold, it is possible to observe the neutral distribution of cluster sizes. 
If the source conditions are appropriately chosen, this distribution can peak 
at sizes corresponding to closed electronic shells, Figure 6 (bottom). 

Cluster intensities can sometimes be increased by a factor of ten by us- 
ing a seed to nucleate the cluster growth. For example, by adding less than 
0.02% SO 2 to the He cooling gas, CS 2 SO 2 molecules form which appar- 
ently promote further cluster growth. Mass spectra of Cs„+2(S02) clusters 
obtained [15] using four different dye-laser photon energies are shown in 
Figure 7. Although it is not possible to distinguish the individual mass 
peaks in this condensed plot, it is evident that the spectra are characterized 
by steps. For example, a sharp increase in the mass-peak intensity occurs 
between n = 92 and 93. This can be more clearly seen if the mass scale is 
expanded by a factor of 50 (Fig. 8). 

Notice also that the step occurs at the same value of n for clusters con- 
taining both one and two SO 2 molecules. In addition to the steps for n = 
58 and 92 in Figure 7, there are broad minima in the 2.53 eV spectrum at 
about 140 and 200 Cs masses. These broad features become sharp steps if 
the ionizing photon energy is decreased to 2.43 eV. By successively decreas- 
ing the photon energy, steps can be observed for the magic numbers n = 58, 
92, 138, 198 ± 2, 263 ± 5, 341 ± 5, 443 ± 5, and 557 ± 5 [15,17]. However, 
the steps become less well defined with increasing mass. We have studied 
the mass spectra of not only Cs„+ 2 (S 02 ) but also Cs„+ 4 (S 02 ) 2 , Cs„+ 20 , 
and Cs„+ 402 . They all show step-like features for the same values of n. 
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Fig. 6. Mass spectra of (Na)„ clusters obtained using ionizing light near the 
ionization threshold (top) {hu = 3.1 eV) and well above the ionization threshold 
(bottom) {hv— 4.0 eV). In both cases the neutral cluster beam was heated with 
2.54 eV and 2.41 eV laser light. 



First, we would like to offer a qualitatiave explanation for these results 
and then support this explanation with detailed calculation. Each cesium 
atom contributes one delocalized electron which can move freely within the 
cluster. Each oxygen atom, and each SO 2 molecule, bonds with two of these 
electrons. Therefore, a cluster with composition Cs„+2(S02), for example, 
can be said to have n delocalized electrons. The potential in which the 
electrons move is nearly spherically symmetric, so that the states are char- 
acterized by a well-defined angular momentum. Therefore, the delocalized 
electrons occupy subshells of constant angular momentum which in turn 
condense into shells. When one of these shells is fully populated with elec- 
trons, the ionization energy is high and the clusters will not appear in mass 
spectra obtained using sufficiently low ionizing photon energy. 
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Fig. 7. Mass spectra of Cs„+2(S02) clusters with decreasing photon energy of 
the ionizaing laser from 2.53 eV (top) to 2.33 eV (bottom). The values of n at 
the steps in the mass spectra have been indicated (Ref. [15]). 



In other experiments [9] the closing of small subshells of angular momen- 
tum was shown to be accompanied by a sharp step in the ionization energy 
for Cs-0 clusters having certain sizes, namely for Cs„+202 with n = 8, 18, 
20, 34, 58 and 92. The closing at n = 40 seen in all other alkali-metal 
clusters could not be observed, neither in the experiments nor in the calcu- 
lations. The steps were observed for clusters containing from one to seven 
oxygen atoms. 

5 Density functional calculation 

Self-consistent calculations have been carried out applying the density func- 
tional approach to the spherical jellium model [10,11]. We used an exchange 
correlation term of the Gunnarsson-Lundqvist form and a jellium density 
rs = 5.75 corresponding to the bulk value of cesium. This model implies 
two improvements over the hard sphere model discussed earlier. Firstly, 
electron-electron interaction is included. Secondly, the jellium is regarded 
to be a more realistic simplification of the positive ion background than the 
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Fig. 8. Expanded mass spectra of Cs„+2z(S02) clusters for an ionizing photon 
energy of 2.48 eV. The lines connect mass peaks of clusters containing the same 
number 2 of SO2 molecules. Notice that the steps for clusters containing (SO2) 
and (802)2 are shifted by two Cs atoms (Ref. [ 15 ]). 



hard sphere. The ion is taken into account only by omitting the cesium 
electrons presumably bound to oxygen. The calculations were performed 
on Cseoo clusters [25]. 

We found, that if a homogeneous jellium was used, the grouping of 
subshells was rather similar to the results of the infinite spherical potential 
well. However, a nonuniform jellium yielded a shell structure in better 
accordance to experimental results. We found that the subshells group fairly 
well into the observed shells only if the background charge distribution is 
slightly concentrated in the central region. This was achieved, for example, 
by adding a weak Gaussian (0.5% total charge density, half-width of 6 a.u.) 
charge distribtuion to the uniform distribution (width 48 a.u.). Figure 9 
shows the ordering of subshells obtained from this potential. This leads to 
the rather surprising result that the Cs’*' cores seem to have higher density 
in the neighborhood of the center perhaps due to the existence of the 
ion. All attempts to lower the positive charge density in the central region 
led to an incorrect ordering of states. 

The first calculation addressed the problem of the grouping of low-lying 
energy levels in one large Csgoo cluster. However, in the experiment the 
magic numbers were found by a rough examination of ionization potentials 
of the whole distribution of cluster sizes. A more direct way to explain 
magic numbers is to look for steps in the ionization potential curve of Cs-0 
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Fig. 9. The self-consistent, one-electron states of a 600 electron cesium cluster 
calculated using a modified spherical jellium background (Ref. [25]). 



clusters. Therefore, we calculated the ionization potentials of Cs„+20 For 
n < 600 and of (Na)„ for n < 1100 using the same local-density scheme de- 
scribed above, Figure 10. Starting from a known closed-shell configuration 
for n = 18, electrons were successively added. Three test configurations 
were calculated for each cluster size testing the opening of new subshells. 
The configuration with minimum total energy was chosen for the calculation 
of the ionization potential. 

We found that the lower magic numbers n = 34, 58, 92 were well repro- 
duced. For higher n distinct steps in the ionization potential were observed 
for n = 138, 196, 268, 338, 440, 562, 704, 854 and 1012. The absolute 
values of the calculated ionization potentials can be brought into better 
agreement with experiment by assuming that clusters have a 10-15% lower 
electron density than is fouond in the bulk. Magic number clusters exhibit 
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Fig. 10. Ionization potentials calculated as a function of n for (Na)„ clusters. 
A positive background charge distribution slightly concentrated in the central 
region has been used. Notice the similar behavior of the ionization energies of the 
chemical elements (inset) (Ref. [25]). 



unusually high ionization energies for the same reason rare gas atoms do: 
they possess a closed shell electronic configuration, Figure 10. In this sense 
the metallic clusters behave like giant atoms. 

6 Observation of supershells 

Although nuclear physicists speculated on the possible existence of super- 
shells several decades ago, the phenomenon has never been observed in 
atomic nuclei for a very simple reason. No nucleus contains enough fermions 
to allow supershell formation. However, there is almost no limit to the num- 
ber of electrons that can be contained in metal clusters. 

Supershells are the periodic appearance and disappearance of shell struc- 
ture in the energy density of states of a fermion system. In order to make 
clear the physical origin of supershells, it is necessary to go back one step 
to the semiclassical description of shells. Shells are associated with a char- 
actristic length. Each time an integral number of fermi wavelengths fit into 
this length, a new shell has formed. The systems that we are studying are 
so large, that the classical picture of an electron bouncing back and forth 
inside a metal cluster is not completely without meaning. The character- 
istic length associated with a set of shells is just the length of a closed 




16 



Atomic Clusters and Nanoparticles 




Fig. 11. Mass spectrum of (Na)„ clusters photoionized with 3.02 eV photons. 
Two sequences of structures are observed at equally spaced intervals on the 
scale - an electronic shell sequence and a structural shell sequence. 



electron trajectory within the clusters. For spheres, two closed trajectories 
with almost the same length turn out to be the most important - a tri- 
angualr path and a square path. This leads to two sets with nearly the 
same energy spacing. These two contributions interfere with one another to 
produce a beat pattern known as quantal supershells. The first attempts to 
observe supershell structure in our laboratory were hindered by the unex- 
pected appearance of a second set of shells in clusters containing more than 
1500 atoms, Figure 11. 

These proved to be geometric shells of atoms that masked the weaker 
electronic shell structure. In the new experiments the geometric shell struc- 
ture was surpressed by “melting” the clusters through heating with a con- 
tinuous laser beam tuned to the plasmon frequency of the electron system. 

The clusters wer warmed prior to ionization with a continuous Ar-ion 
laser beam running parallel to the neutral cluster beam. The laser light 
entred the ionization chamber through a heated window, passed through the 
ionization volume, through 2.2 mm (j) and 3.0 mm (j) skimmer aperatures. 
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Fig. 12. Mass spectrum of (Na)„ clusters using 4.0 eV ionizing light. The top 
spectrum shows the size distribution of cold clusters produced in the source; the 
bottom spectrum, after heating with 2.54 eV laser light. 



through a 3.0 mm nozzle, through! he oven chamber and finally exited 
through a second window where the laser intensity was recorded. Short 
wavelength light was found to warm much more efficiently. Using the 458 nm 
(2.71 eV) laser line, 10 mW proved sufficient to appreciable alter the neutral 
size distribution. 

The size distribution obtained with ionizing photons having energy well 
above threshold are quite different from the spectra discussed in Section 5 
Without the warming laser the mass spectra are without structure, z.e. the 
size distribution of the cold clusters emerging from our source is smooth, 
Figure 12. If the warming laser is turned on we obtain not steps but peaks 
as seen in the bottom of Figure 12. We believe these peaks reflect the 
neutral size distribution of the laser- warmed clusters. It appears that it is 
usually possible to correlate a falling edge of the size disbribution with a 
step in the threshold ionization spectrum. Because of this correlation, we 
will characterize mass spectra obtained using excimer light by the number 
of atoms at steep negative slopes. A more extended mass spectrum of laser- 
warmed sodium clusters obtained with 4.0 eV ionizing photons is shown 
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Fig. 13. Mass spectrum of (Na)„ clusters using 4.0 eV ionizing light and (458 nm) 
2.71 eV continuous axial warming light having an intensity of 500 mW/cm“^. The 
spectrum has been smoothed over one- hundred 16 ns time channels (top). In order 
to emphasize the shell structure an envelope function (obtained by smoothing over 
20 000 time channels) is subtracted from a structural mass spectrum (smoothed 
over 1500 time channels). The difference is shown in the bottom spectrum. 



at the top of Figure 13. This spectrum has been smoothed with a spline 
function extending over one-hundred 16ns time channels. Notice that the 
structure observed does not occur at equal intrvals on a scale linear in mass. 
In order to present this structure in a form more convenient for analysis, 
the data have been procesed in the following way. First, the raw data 
is averaged with a spline function extending over 20 000 time channels. 
The result is a smooth envelope curve containing no structure. Second, the 
raw data is averaged with a spline over 1500 channels. Finally, the two 
avearges are subtracted. The result is shown in the bottom of Figure 13. 
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Five independent measurements were made under the same experimental 
conditions. The positions, relative heights and widths of features in the 
mass spectra were well reproducible. 

The clusters in this experiment have been warmed with a continuous 
laser beam running parallel to the neutral cluster beam. But what is implied 
by “warming”? consider the fate of a typical 500-atom cluster as it moves 
from the nozzle to the detector. 

It leaves the nozzle with the temperature of the He carrier gas (~100 K) 
traveling at a velocity of about 350 m/s. during its 1 ms flight to the ion- 
ization volume it undergoes no further collisions but does begint o absorb 
photons. We do not really know the absorption cross-section of this cluster 
at the warming laser wavelength (458 nm). However, 1 A^/atom is a typical 
upper limit for smaller clusters. It can be expected that the cross-section 
will be cluster size dependent. This size dependence will be reflected in the 
final mass distribution. The cluster absorbs the first 25 photons without 
evaporating any atoms, gaining an excess energy of about 70 eV and reach- 
ing a temperature of about 500 K. This all takes place in the first 450 /iS. 
The temperture of the cluster remains rather constant for the last half of 
its journey to the ionization volume. It continues to absorb photons, of 
course, but after each absorption it evaporates 2 or 3 atoms returning to 
its original temperature before absorbing the next photon. It loses a total 
of 80 atoms, i.e. 16% of its original mass. It appears that this repeated 
heating and cooling through the “critical temperature for evaporation” on 
this time scale favors the evolution of a size distribution with relataively 
strong peaks near sizes corresponding to closed electronic shells. 

The photon energy (4.0 eV) of the ionizing laser has been chosen so 
that it is well above the ionization threshold (3.0 eV) of the sodium clus- 
ters investigated. The excess energy (1 eV) insufficient to cause only one 
atom to evaporate. This is a neglibible loss on the mass scale we will be 
considering. For this reason, we believe that the magic numbers obtained 
reflect variations inthe size disbtribution of the neutral clusters induced by 
the warming laser. 

The concept of shells can be associated with a characteristic length. 
Every time the radius of a growing cluster increases by one unit of this 
characteristic length, a new shell is said to be added. A good rough test of 
whether or not shell structure has been observed can be quickly carried out 
by plotting the shell index as a function of the radius or . If the points 
fall on a straight line, the data is consistent with shell formation. That this 
is indeed the here, can be seen in Figure 14. However, an even better fit 
can be obtained using two straight lines with a break between shell 13 and 
14. This too can be interpreted in an interesting way. 
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Fig. 14. The electronic shell closing falls approximately on a straight line if plotted 
on an scale. An even better fit is obtained using two straight lines with a 
break between shells 13 and 14. Such a break or “phase change” would be an 
indication of supershell structure. 



It has been suggested [19-28] that shell structure might periodically ap- 
pear and disappear with increasing cluster size. Such a supershell structure 
can be understood as a beating pattern created by the inteference of two 
nearly equal periodic contributions. Quantum mechanically the contribu- 
tions can be described as arising from competing energy quantum numbers. 
Classically, the contributions can be described as arising from two closed 
electron trajectories within a spherical cavity. One trajectory is triangular, 
the other square. 

7 Fission 



The fission of clusters was one of the first subjects [29-43] to be investigated 
in the newly developing field of cluster research. It is often referred to as 
Coulomb explosion, since the fission is caused by the Coulomb repulsion of 
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like charges concentrated in a cluster smaller than a critical size. The kinetic 
energy that the charged fragments acquire can be as high as several eV. 
Most of these studies have delt with the fission of doubly or triply charged 
clusters. Recently we have shown that it is possible to induce charges as 
high as +14 on large Na clusters by photoionization [44]. In this section we 
will discuss fission in these highly charged clusters. 

The technique we have used to study fission in sodium clusters is pho- 
toionization time-of-flight (TOF) mass spectroscopy. The cluster source is 
a low pressure, inert gas, condensation cell. The clusters were photoionized 
with a 50 mJ, 15 ns, 193 nm (6.4 eV) excimer laser pulse focussed onto the 
neutral cluster beam with a 150 cm focal length quartz lens. The ionized 
clusters were heated 30 ns later with a second 5 mJ/mm^, 470 nm (2.6 eV) 
laser pulse. 

The energy (/) required to remove an aditional electron from a cluster 
that already has charge +z can be written 

I{z,R) = W + {a + z)e^/r (7.1) 

where W is the bulk work function, e is the electronic charge, and R is 
the radius of the cluster. Clearly, we have assumed that the cluster can be 
modelled as a conducting sphere. Various values of a have beenused in the 
literature. We will assume a is 0.5 and point out that for larg values of z 
the value of a used becomes unimportant. Since the radius of the cluster 
can be related to the number of electrons (or in our case atoms) through 
the Wigner-Seitz radius, R? = r^n, equation (4) can be rewritten as 

I{z,n) = W + {a + z)e^ /rsv}/^). (7.2) 

It can be seen from this expression that a larage amount of energy is required 
to remove electrons from small, highly charged clusters. If the amount of 
energy available is limited to that in one photon, then the maximum charge 
attainable for a cluster of a given size is 

2max = 1 - a+ (/ijz- TT)rsn^/^/e^. (7.3) 

This means that a two-dimensional cluster space (n, z) can be divided by 
a line into clusters that can be formed with, for example, an ArF excimer 
laser and those that cannot. Figure 15. Also indicated in this figure is a 
line dividing the space into stable and unstable clusters. Notice that all of 
the values of n and z accessible with the ArF photons characterize stable 
clusters. The unstable clusters which we would like to investigate cannot be 
produced by direct multi-step ionization with this laser. There is, however, 
a way out of this dilemma. 
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Fig. 15. The two-dimensional cluster space (n, z) can be divided by straight lines 
into Na^ clusters which can (cannot) be produced with an ArF laser and into 
clusters which are (are not) stable against Coulomb explosion. Notice that all 
clustrs (except for z = 2) that can be ionized with 6.4 eV photons are stable. 



The ArF laser can be used to prepare a stable, highly charged, large 
cluster and then this large cluster can be reduced in size by heating and 
subsequent evaporation. A second laser pulse, containing photons with 
energies near the plasmon resonance of the sodium clusters, is used for 
heating. The clusters shrink down in size without charge until they reach 
a critical size at which they undergo fission. A mass spectrum, or better 
said, an nj z spectrum for Na)( clusters produced in this way is shown in 
Figure 16. The log scale emphasizes, perhaps even overemphasizes, the 
effect we wish to show. 

The highest set of mass peaks belongs to singly-charged sodium clusters. 
The peaks which occur exactly half-way between the Na+ peaks are due to 
Na^+ clusters. Notice that new sets of peaks appear in the spectrum at 
various critical values of nj z. This is perceived as a step-wise darkening 
of the mass spectrum. In the lower part of Figure 16 we see the threshold 
region for the appearance of Na^"*" on an expanded scale. Another segment 
of the spectrum on an expanded scale is shown in Figure 17. This segment 
is near the threshold for the appearance of Na®“*' on. 

In this way, by careful examination of the fine structure in the mass 
spectra, it is possible to determine that the critical sizes for 2 = 1, 2, 
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Fig. 16. An niz spectrum of Na^ clusters, large clusters were first charged by 
multistep ionization using a high-fiuence, arF laser. The clusters were then heated 
to reduce their size by evaporation. A portion of the spectrum is expanded to show 
the appearance threshold for Na®"*" clusters (black filled). 



3, 4, 5, 6 and 7 are 27 ± 1, 64 ± 1, 123 ± 2, 208 ± 5, 321 ± 5 and 
448 ±10 atoms, respectively. These values are plotted on a double log scale 
in Figure 18. They lie on a straight line with slope 2. This means that the 
critical condition for stability is 

z^jn < 0.125. (7.4) 

In is proportional to the so-called fissility parameter used in nuclear phy- 
ciscs as a measure of stability. It has been shown inthe past that this 
parameter is also useful for clusters with small total charge [40,41]. Here 
we see that it continues to be applicable for values of z up to 7. 
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Fig. 17. An expanded portion of Figure 16 showing the first appearance on Na®^ 
clusters. Smaller clusters in this charge state are not stable. 



The results of an extensive theoretical investigation of fission in Na clus- 
ters have recently been published [45,46]. an important assumption made 
inthis work was that the fission is symmetric, i.t. the mass and charge 
of the original cluster are divided nearly equally between the fission pro- 
ductes. Unfortunately, we have no evidence at this time to either support 
or to challenge this assumption. Still, it is useful to compare the results of 
this calculation with our experiment. Figure 18. 

Here, cluster space (n,z) has been divided into stable metastable and 
unstable regions according to the tunneling criteria appropriate in nuclear 
physics. The fission process for nuclei can be described qualitatively in 
terms of three energies; the initial energy (Aj) of the charged, nondeformed 
clusters, the final energy (E{) which is the sum of the energies of the non- 
interacting fission products. The third energy necessary to characterize 
fission products. The third energy necessary to characterize fission is the 
energy (Ab) of the lowest barrier separating the initial and final states. 
If Eh < Ei, the nucleus is unstable. If Ef < E^ < Eh, the nucleus is 
metastable to fission by tunneling through the barrier. Finally, if Ef > E,, 
then the nucleus is stable, since tunneling for clusters has negligible prob- 
ability, it is more accurate to say at zero temperature clusters are either 
stable or unstable, depending on whether there is a barrier or not. At finite 
temperature clusters can be classified as either unstable (Ab < E{) or as 
metastable (Ab > E\) to thermal hopping over the barrier. In practice it is 
useful to further subdivide the set of metastable clusters [36-38]. At finite 





T.P. Martin: Experimental Aspects of Metal Clusters 



25 




1 2 4 6 8 10 15 20 

z, Charge (a. u.) 

Fig. 18. The number of atoms in the smallest experimentally observed Na^ clus- 
ters (filled circles). The cluster space can be divided into stable, metastable and 
nnstable regions, according to [45], using the symmetric liqnid-drop model. 



temperatures clusters can lose mass and thermal energy by the evaporation 
of neutral atoms. Evaporation will always compete with fission and will, in 
fact, dominate if Eb is greater than E^, the energy needed to evaporate an 
atom. For this reason, we have the conditions 

a) unstable to fission, Eb < Ei] 

b) metastable to fission, Ei < Eb < Ey] 

c) metastable to evaporation, E[ < E^ < Eb- 



Since previous experiments [36-38] on doubly-charged Na and K clusters 
indicate that fission is strongly asymmetric, it would be appropriate now 
to consider this alternative. Even using the droplet model it is not easy to 
calculate the height of the barrier if the mass and charge can be distributed 
arbitrarily between the fission products. For this reason we will consider an 
energy which does not exactly characterize a real system, but it is trivial to 
calculate and therefore useful. It is the energy at the instant of scission. Eg. 
That is, starting from the final state we merely bring the fission products 
together until they just touch. Of course the energy increases monotonically 
from Ef to Eg according to Coulomb’s Law. We assume that Eg is nearly 
equal to the barrier height for the fission process. There is no unique value 
of Eg for a cluster in initial state n and z. Rather, a whole set of values exist 
corresponding to the various ways of distributing charge and mass between 
the fission products. However, one value of Eg has special significance and 
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z, Charge (a. u.) 

Fig. 19. Cluster space divided into evaporative and metastable und unstable 
regions. The barrier height has been obtained using an oversimplified model (see 
text). The number of atoms in the smallest experimentally observed Na^”*" clusters 
is shown by the filled circles. 



that is the minimum value. If this minimum value of < Ej the cluster 
is unstable and will spontaneously fission, even at zero temperature. If 
Es > Ei the cluster is stable against fission. Strictly speaking, one shuld 
say metastable because at finite temperatures the final state can be reached 
by jumping over the barrier, no matter how high. The results of these 
calculations are summarized in Figure 19. The initial and final energies are 
determined using the sphereical droplet model assuming only two fission 
fragments with arbitrary size and charge and assuming a surface tension 
parameter cr = 200 dyne/cm appropriate for sodium. One might expect that 
the experimental points would fall on the line corresponding to Eb = E^. 
Clearly, this is not the case since E^ is known [36-38] to have a value of 
about 1 eV. That is, this rough model overestimates the barrier height by 
a factor of two. Various refinements are clearly needed; proper treatment 
of the Coulomb energy allowing for electron redistribution as the fragments 
move away from one another [48], a description of the asymmetric fission 
before scission, shell effects [49, 50] and entropy effects. Also needed are 
experiments demonstratinghow mass and charge are distributed between 
the fission fragments. 

8 Concluding remarks 

Clearly, cluster science has greatly benefitted from the inspired work carried 
out by nuclear physicists decades ago. The shell model, the liquid droplet 
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model, and the theory of giant dipole resonances have provided a ready and 
appropriate framework for understanding the properties of metal clusters. 
Hopefully, in the future, the exchange between nuclear science and cluster 
science will not be so one-sided, because metal clusters offer us a unique 
opportunity to study well-characterized, large fermion systems. 
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Abstract 

An experiment is described which allows to measure the caloric curve 
of size selected sodium cluster ions. This allows to determine rather 
easily the melting temperatures, and latent heats in the size range 
between 55 and 340 atoms per cluster. A more detailed analysis is 
necessary to show that the cluster NaJj^ has a negative microcanon- 
ical heat capacity, and how to determine the entropy of the cluster 
from the data. 

1 Introduction 

Melting in a macroscopic object occurs at some well defined temperature. 
This is no longer true for a small particle or cluster. If one talks about 
melting one must first verify whether the concepts of solid or liquid can be 
transferred to small systems. The answer is yes: at low temperatures the 
atoms in a cluster or in a large piece of matter make only small amplitude 
vibrations around a fixed position. It takes a lot of energy to push an atom 
from its position and one has a solid. If the temperature increases, atoms in 
the cluster can visit neighbouring places and start a diffusive motion. There 
is very little resistance to external deformation forces and the cluster is 
liquid. This is demonstrated in Figure 1 which shows a simulation for a cold, 
rigid and a liquid, hot cluster. The simulation was done with the potential 
parameters for argon. But the main features of small harmonic oscillations 
at low and more diffusive motion at high temperatures are independent of 
the force field assumed. 

Concepts like temperature, phase etc. had originally been defined only 
for infinitely large systems. But one has learned how to generalize these con- 
cepts to finite particles, although there are still some more semantic quarrels 
about nomenclature. For the solid to liquid transition of finite systems one 



The work was supported by the Deutsche Forschungsgemeinschaft through SFB 276. 



© EDP Sciences, Springer- Verlag 2001 




32 Atomic Clusters and Nanoparticles 




Fig. 1. A simulation of a cluster of 147 atoms at T = 0 K (left figure) and a high 
temperature. The darker the colour, the more strongly an atom is bound. The 
calculation was made using a Lennard-Jones potential, which is not adequate for 
the sodium clusters discussed in this report. So one does not know whether the low 
temperature structure of NaJj^ really looks like this icosahedron, whose regular, 
five fold symmetry can be well seen here. The general features of the solid to liquid 
transition however, should be independent of the force held assumed. (Calculation 
by Moseler, colour pictures on the WEB site of the author.) 



finds four main differences with respect to the bulk counterpart, 1) the 
melting point is generally reduced (Sect. 4.1), 2) the latent heat is smaller 
(Sect. 4.2), and 3) the transition does not occur at one definite temperature 
but is spread out over a temperature range. The fourth difference is quite 
spectacular: the heat capacity can become negative, i.e. the temperature 
of the system can decrease upon energy addition (Sect. 6.1). 



Experimentally, the depression of the melting point has been shown [1] as 
early as 1908. Many experiments have been done with particles on surfaces 
which contained thousands or millions of atoms [2]. Also, very many com- 
puter simulations have been performed to get a better understanding [3-6]. 
They have shown how the bulk concepts like temperature and melting can 
be generalized to finite systems. Nearly all simulations have been performed 
for free clusters of known size, while most earlier experiments have been per- 
formed with particles on surfaces having a rather broad size distribution. 
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2 Cluster calorimetry 

All experiments to study the melting point of bulk material or of clusters 
can be divided into two classes: 

1. one studies the change of some physical property across the melting 
point, e.g. change in photon or X-ray diffraction patterns, or mobility; 

2. thecaloric curve E = E{T) is measured, that is the clusters internal 
energy E as a, function of its temperature T. This curve contains all 
the thermodynamic information, as discussed below. 

The experiment discussed here, is of the second type. It measures the caloric 
curve E = E(T) for free sodium cluster ions, where the number of atoms is 
exactly known. 




Fig. 2. Calorimeter (left), caloric curve E = E{T) and heat capacity C(T) for 
a macroscopic sample. The caloric curve increases abruptly at the sharp melting 
temperature (Tmeit)- The height of the jump is given by the latent heat q of 
melting. The heat capacity has a delta function like peak at Tmeit (see Eq. (2.1)). 



2.1 The bulk limit 

It is often very useful in cluster science to study the asymptotic behaviour, 
that is that of the atom and that of the bulk. Although there is a well 
understood bulk limit, the atom and the diatomic molecule do obviously 
not have a melting point. It seems that at least seven atoms are needed 
before something like a liquid behaviour is seen in numerical simulations. 

It is in principle easy to measure the caloric curve for bulk materials. 
One puts the material into a thermally isolated container (see Fig. 2), adds 
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energy E and measures the temperature T . After compensation for the 
thermal mass of the container, one can construct the caloric curve, T = 
T{E), or E = E{T). 

For a large system, the caloric curve exhibits a step at the melting 
temperature Tmeit- The height of the step is the latent heat q. Energy 
can be added to the system at Tmeit without an increase of temperature; 
all the energy is used to destroy the regular solid structure, and thus in- 
crease the entropy. This fact is widely used when drinks are cooled by ice 
cubes. The large latent heat of ice makes it an efficient coolant. Only when 
the system has become completely liquid, the temperature will rise again 
upon addition of energy. Note, that one has phase separation for a large 
sample, i.e. water and ice coexist together at Tmeit- As discussed below 
(Sect. 6.1), phase separation is not possible for a sufficiently small system. 
The derivative of the caloric curve is called the heat capacity C'(T) or also 
the specific heat [7]: 



C{E) 



dE{T) 

dT 



( 2 . 1 ) 



It has a delta function like peak at the melting temperature for a bulk 
system. 

It is possible today to construct “nano-calorimeters” using the tech- 
niques of the semiconductor chip industry, which carry the principle of 
Figure 2 to very small particles. In the year 2000 the lower limit of this 
technique is a particle size of about 1000 atoms [8]. Also electron and 
X-ray diffraction has been used to study clusters deposited on surfaces [8]. 
At present these methods are however not applicable to free, mass selected 
clusters in vacuum for two reasons: first no method of temperature mea- 
surement is known in this case, and also the density of mass selected clus- 
ters is so small, that it is near to impossible to collect a diffraction signal. 
Earlier attempts to circumvent this problem can be found in [9-13]. An 
experiment on electron diffraction on stored cluster ions has recently been 
published [32] , which opens the possibility of doing a type 1 experiment on 
stored, thermalized ions. 



2.2 Calorimetry for free clusters 

The thermal experiments on free, mass selected clusters have only been 
performed with sodium so far. Sodium has been chosen for two reasons: 
1) It was the aim of this experiment to work with mass selected clusters, 
i.e. the number of atoms in the cluster is exactly known. In this case one 
has to use a mass spectrometer, which forces one to work with ions. For 
non-metallic elements there can be quite a difference between the electronic 
and geometric structure of a neutral and a positively charged cluster [33]. 
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Due to the delocalized nature of the chemical bond this is not the case in 
metals. 2) From all the metals available, sodium is the easiest to calculate. 

The cluster calorimetry [14-19] method developed in our group does the 
inverse of the bulk method shown in Figures 2 and 3. Size selected clusters 
of known temperature are prepared and their energy (more exactly their 
energy distribution, averaged over an experimental broadening) is measured. 
The method consists essentially of two steps, as indicated in Figure 4. 

Step 1 Preparation of size selected clusters of known temperature. 
Cluster ions are produced and thermalized in a heat bath. A mass 
spectrometer is used to select a single cluster size. This prepares 
clusters of known size and known temperature T. 

Step 2 Determination of their energy. The clusters are irradiated by a 
laser beam and absorb several photons. This leads to the evaporation 
of atoms. A second mass spectrometer measures the distribution of 
the fragment ions produced, which has a characteristic shape as shown 



Fig. 3. One would like to make an 
experiment as shown here, which 
would be the direct generalization 
of the setup in Figure 2 to free clus- 
ters. Energy is supplied, here by 
a candle, in an experiment more 
probably by a laser. The tempera- 
ture is measured by a thermometer, 
which could be a small molecule, 
whose states are interrogated by 
another laser. This method of 
temperature measurement has been 
studied but no reliable temperature 
scale could be established so far. 
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Production of cluster ions of known Energy determination 
mass and temperature 

Fig. 4. Schematic of the experiment. Cluster ions are produced and thermalized 
in a heat bath of known temperature T. A first time of flight mass spectrometer 
(TOF 1) selects one cluster size. In this way, a cluster of known temperature and 
known mass is produced (see Step 1 above). The clusters absorb several photons 
from a pulsed laser, and the distribution of the charged fragments is measured 
by TOF 2. From these data the energy E before irradiation can be determined, 
as explained in Figure 5. When energy and temperature are known, the caloric 
curve E = E{T) can be constructed (Step 2 above). 



in Figure 5. Different numbers of absorbed photons lead to clearly 
separated groups of fragments in the mass spectrum, with the distance 
between two groups corresponding to exactly one photon energy. This 
allows one to calibrate the mass scale in terms of energy [14-17]. 



If the temperature of the heat bath is varied, the inner energy of the se- 
lected cluster changes and thus also the number of evaporated atoms. The 
fragment groups shift on the mass scale as shown in the inset of Figure 5. 
These twodimensional plots are the primary data of the experiment. From 
them the caloric curve E = E(T) can be determined directly as indicated in 
the figure. This method of data treatment has several advantages compared 
to the one used earlier [14-16]; it is more robust, and avoids the averaging 
inherent in the use of the finite difference approximation to equation (2.1) 
used earlier (Eq. (4) of [14]). 



3 Experiment 

The schematic of the experiment is shown in Figure 4. In more detail, two 
Time-Of-Flight (TOF) mass spectrometers of the refiectron type are used 
in tandem. The detector is an Even cup [20] employing up to 30 kV post 
acceleration for efficient detection of the heavier clusters. The fragmentation 
laser is an excimer pumped dye laser. The experimental setup is similar to 
those used by other groups. The novel and at the same time crucial part of 
the experiment is the source for thermalized cluster ions. 



H. Haberland: Cluster Melting 



37 




number of atoms evaporated 



Fig. 5. The energy F of a free cluster can be obtained from its photofragmentation 
mass spectrum. In the experiment, a NaJj^ cluster is prepared at a temperature of 
240 K. After absorption of several photons the cluster starts to emit neutral atoms. 
The intensity of the remaining charged fragments is plotted against the number of 
evaporated atoms (lower panel). Without laser one observes only the large peak 
on the left, corresponding to the intact Na)^ 47 . four absorbed photons one 
has an approximately Gaussian distribution of photofragments centered around 
nine evaporated atoms. For five (six) absorbed photons the Gaussian is centered 
near thirteen (seventeen) ejected atoms. The distance between the maxima of 
the Gaussians corresponds to exactly one photon energy, which allows an energy 
calibration of the mass scale. As for all calorimeters, energy calibration is only 
good up to an additional constant. Inset: the mass peaks are connected by a 
smooth function, and plotted against the temperature of the thermalization cell 
of Figures 4 and 6. White implies high intensity. The caloric curve is obtained by 
connecting the maxima; one example is shown. The other maxima are just shifted 
by the photon energy. The melting temperature is indicated at the left and the 
latent heat q at the top. Note, that the lower panel is for 240 K, while the inset 
is for 180 to 330 K. 
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3.1 The source for thermalized cluster ions 

The source is shown in Figure 6. It consist of two main parts, the cluster 




Fig. 6. The gas aggregation source used to produce thermalized cluster ions. 
Sodium is evaporated from a small, tilted container into a cold stream of He gas. 
A gas discharge (not shown) is ignited and produces charged species. Aggregation 
is very effective up to diaphragm Dl. The cluster formation stops effectively after 
D1 due to the low density of sodium atoms there. The helium gas transports the 
clusters from the source region (left) through Dl into the thermalization region. 
There they make 10® to 10® collisions with He atoms, whose temperature T can 
be adjusted by a heater/cooler. The cluster acquire a canonical distribution of 
energies at the temperature T. 

source itself (left part) and the thermalization stage. In the source, sodium is 
evaporated into a cold, slowly streaming helium gas of about 70 Pa. Mostly 
sodium atoms and a few dimers leave the boiling liquid, which aggregate in 
the cold He gas to clusters. The He gas can be added at two points, one 
at the far side of the aggregation tube, or it can be directly blown into the 
sodium container. One can optimize the intensity by varying the relative 
gas flow through the two inlets. A weak electric gas discharge (~100 V, 
about 1 mA) is ignited with the sodium container working as a hollow 
cathode. This leads to an efficient production of positively charged sodium 
cluster ions. Probably, charged molecular dimers, like NaJ or HeJ act as 
condensation germs. Once a dimer is built, cluster formation starts rapidly. 
The process is similar to cloud formation in nature and will not be discussed 
here. 

The tube containing the aggregation zone has a diameter of about 10 cm. 
It is cooled by liquid nitrogen, which facilitates the production of smaller 
clusters, and also leads to a more stable operation. The clusters together 
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with the helium gas leave the aggregation zone through a diaphragm (Dl) 
of variable diameter, which is an iris aperture normally used in optics. The 
clusters enter a copper tube, which can be cooled down to 30 K and heated 
up to 600 K. They leave this tube through a second variable iris (D2). Most 
of the helium is pumped away, and the clusters continue to fly through 
a skimmer into the first mass spectrometer. Both irises and the skimmer 
are electrically isolated and carry a small potential (less than 10 V) to 
compensate for surface charges and plasma effects. 

The setup with the two adjustable irises is mechanically somewhat com- 
plicated, but at least the diameter of iris D2 must be adjustable during 
the experiment if one wants to scan over a sufficiently large temperature 
range. The viscosity of a gas scales as and the gas flux through small 

holes thus diminishes with increasing temperature. The cluster intensity is 
a sensitive function of the gas flux, and typically every 30 K the otherwise 
automatic measuring process has to be interrupted in order to optimize the 
beam intensity by adjusting iris and gas flow. 

4 Caloric curves 

The primary results of the experiment are two-dimensional data fields as 
shown in Figure 5. One obtains the caloric curve by connecting the maxima 
of the fragment distributions. Figure 7 shows an example. A more detailed 
analysis of what is actually measured is given below (Sect. 5.3). 

At low temperatures, the caloric curve increases roughly linearly, and 
then shows a change in slope which is due to the melting process. Near the 
melting temperature a relative large amount of energy is needed for a small 
increase of temperature. The sudden jump of bulk systems is smoothed to 
a finite width. The melting temperature is indicated. The latent heat q is 
equal to the increase in the caloric curve near Tmeit- For higher tempera- 
tures, a nearly linear increase is again observed. The following data can be 
read off Figure 7: 1) the melting temperature Tmeit as given in Figure 8, 
2) the latent heat given in Figure 9, and 3) more generally, the specific 
heat as a function of temperature. In the temperature range studied, the 
heat capacity (the derivative of the caloric curve with respect to the tem- 
perature) is nearly constant below and above Tmeit- It is generally higher 
above Tmeit- For smaller cluster sizes, melting temperature and latent heat 
diminish. We have measured several caloric curves, (e.^. Na^p and Na^g) 
where there seems to be only a bend (and not a smoothed out step) in the 
caloric curves. Data from these curves are not included below. 

At a temperature above 400 K, the clusters are so hot that they can 
evaporate atoms without having absorbed a photon. Thus they are no 
longer thermalized when they arrive in the laser interaction zone. Therefore 
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Fig. 7. Caloric curve for NaJjg. Below 100 K the curve has been extrapolated 
using the bulk result. The soft increase near 260 K is due to the melting process. 
Melting temperature and latent heat can be easily read off the curve. Above 
400 K the data deviate from the caloric curve as the cluster can no longer be 
thermalized. 



the very high temperature part of the curve does not belong to the caloric 
curve, but describes an evaporative ensemble [21], whose temperature and 
energy can be read off Figure 7. 

From the discussion below (Sect. 5.3), it will become apparent that 
Figure 7 does show for temperatures below 400 K a microcanonical caloric 
curve, but averaged over the unavoidable broadening due to the photofrag- 
mentation process. Therefore a possible backbending cannot be seen, and 
the curve looks like a canonical one. Outside T„ieit± 20 K canonical and 
micro-canonical curves agree within experimental error. 



4.1 Melting temperatures 

The temperature corresponding to the of largest slope of the caloric curve 
is identified as the melting point. The data available so far are shown 
in Figure 8. The melting points show surprisingly large variations: one 
additional atom can change Tmeit by up to 10 K. Two main points can be 
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number of sodium atoms 



Fig. 8. The melting temperatures of Na^ are plotted against the number of 
atoms. Large fluctuations are seen whose origin is not well understood. They 
do not correlate with electronic (dotted line) or geometric (dashed) shell closings, 
which are indicated in the figure. The bulk melting temperature is 371 K. Note, 
that the zero is suppressed. 



observed: 1) the melting temperatures are about one third lower than in 
the bulk, and 2) they fluctuate by ±50 K. From less complete data it had 
earlier been conjectured that the melting points are high, if electronic and 
geometric shell closings are close to each other. This is no longer supported 
by the new data covering a wider mass range. 

Several calculations [6,22-25] have been performed on melting of sodium 
clusters, but the overall structure of the data in Figure 8 has not been repro- 
duced so far. The number of atoms for the maxima and minima in Figure 8 
do not generally correspond to any known shell closings, be they electronic 
or atomic in origin. Also, it might be that the cluster changes its geometry 
near the melting temperature, as has been observed in a simulation on gold 
clusters [5]. 

Many calculated melting temperatures for a variety of cluster sizes with 
less than 30 atoms show a surprisingly simple scaling with the parameter 
S = Flint /if sur, where Flint and Egur are the mean potential energies of 
the internal atoms and the surface atoms, respectively [26]. It would be 
interesting to see if the larger clusters studied here behave similarly. 

For two cluster sizes (70 and 93 atoms) the caloric curve does not show an 
increase like in Figure 7, but only a bend, which can be either interpreted 
as melting with a very small latent heat, or as the lower temperature of 
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Fig. 9. The latent heat per atom is plotted against the number of atoms in the 
cluster. Large fluctuations are visible which correlate with those of the melting 
point. The line through the data is only drawn to guide the eye. The bulk value 
is 27.5 meV. 




an extended melting range. No upper range is found in the data until 
the clusters evaporate at 400 and 410 K, respectively. These data are not 
included in Figure 8. 

4.2 Latent heats 

Classically, the latent heat is the energy to destroy the lattice at the melt- 
ing temperature. Here it is measured as indicated in Figures 5 and 7 as the 
height of the increase of the caloric curve near Tmeit- The vertical scale in 
Figure 7 is energy calibrated, so that the latent heats are obtained abso- 
lutely. The data are more noisy than those in Figure 8, but they seem to 
indicate a correlation of the extrema with those of the melting points, at 
least for the data above n = 100. This correlation is physically plausible 
as the higher the melting point, the more the crystal lattice can withstand 
thermal energy, i.e. the higher the latent heat. 
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4.3 Other experiments measuring thermal properties of free clusters 



There is quite a history of earlier experiments trying to measure the melting 
behaviour of free cluster. The groups of Buck [9], Even/Jortner [10] looked 
for spectroscopic evidence. Electron diffraction from a not mass selected su- 
personic expansion gives Debye-Scherrer like diffraction rings, the intensity 
of them being a measure of cluster temperature. This was pioneered by the 
Farges/Torchet group [27] and later intensively studied Bartell et al. [12]. 

The Martin group [11] were the first to publish a size dependence of the 
melting temperatures of free clusters. They showed that the structure on 
mass spectra of large sodium clusters depends sensitively on the tempera- 
ture, if the photon energy is chosen appropriately. The disappearance of 
the structure was interpreted as being due to melting. 

Another method has been used to measure melting temperatures of free 
clusters. It has been applied to small tin clusters with the surprising result 
that their melting point is higher than the bulk one [28]. In this experiment, 
cluster ions are injected into a helium gas, and are pulled by an electric 
field through the gas. They make very many collisions, which produces an 
effective friction force. Clusters having a small collision cross section make 
fewer collisions, thus experience a smaller friction force and arrive first. 
Small Sn-clusters have a rather elongated structure which should change to 
near spherical upon melting. A change in collision cross section can thus 
be expected upon melting. No signature of melting is observed, so that 
the authors conclude that tin cluster ions containing 10 to 30 atoms have a 
melting point which is at least 50 K above the bulk one [28,29]. 

The experiments discussed above all belong to class 1, as defined in 
the introduction to Chapter 2, i.e. some physical property (optical or mass 
spectrum, diffusion cross section...) is studied as a function of temperature. 
The only other experiment belonging to class 2, has been used to measure 
caloric curves of free tin clusters [30]. These were produced by a laser 
ablation source using a pulsed nozzle whose temperature is variable. The 
authors estimate that nozzle and cluster temperature deviate by 10 to 20 K 
only. Neutral clusters are studied so that no mass selector can be used. 
The distribution of cluster sizes is not given, but similar experiments give a 
5m/m not smaller than 60%. The energy is measured very differently. The 
cluster impinge on a sensitive pyroelectric foil, whose temperature increase 
leads to a measurable voltage jump. The measured caloric curve looks very 
similar to the one given in Figure 7. Only the solid/liquid transition region 
is broader due to the cluster size distribution. The interpretation of the 
experiment has been questioned [31]. 
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5 A closer look at the experiment 

It was discussed above how caloric curves, melting points and latent heats 
can be directly obtained from the temperature dependence of the mass 
spectra. Here we will take a closer look at the experiment and analyse 
it in more detail. This is necessary, first to understand better what kind 
of experiment is actually performed, and second to extract more detailed 
information from the data. 

5.1 Beam preparation 

Cluster ions are produced and thermalized in a heat bath, which gives them 
a canonical distribution of inner energies. For the further discussion, some 
definitions of basic Thermodyanmics are needed, which are briefly reviewed. 

5.1.1 Reminder: Canonical versMS microcanonical ensemble 

In the theory of Thermodynamics one defines several idealized situations, 
called ensembles, which are particular important or easy to calculate [34]. 
Two of them are realized in this experiment. In the canonical ensemble one 
has a collection of physical systems (atoms, molecules, clusters, spins...) 
which have a fixed temperature and a thermal distribution of energies. A 
physical realization are particles in contact with a large heat bath. 

In a microcanonical ensemble on the other hand, the energy is fixed. A 
physical realization is an isolated particle in vacuum, which can neither emit 
nor absorb photons or atoms. Only for an infinitely large system the two 
ensembles give the same result, for a finite system there can be differences 
[4, 6, 35, 36] which will be particular important near phase transitions. 

5.1.2 A canonical distribution of initial energies 

In the thermalizer of Figures 4 and 6 the helium gas is in contact with the 
metal tube whose temperature is kept constant. Helium is a good heat con- 
ductor at the pressure of about 70 Pa. The He atoms are thermalized at 
the tube wall and in turn thermalize the clusters by collisions. The clusters 
make typically 10® to 10® collisions with the gas, more than sufficient for 
complete thermalization. Thus each cluster can be considered as a small sys- 
tem in contact with a large heat bath, and thus corresponds to a canonical 
ensemble. Theory tell us that in this case the small system has a canonical 
distribution of energies Pt{E), which is given by the density of states of the 
cluster U{E) = exp{S'(A)/fcB} times the Boltzmann factor [34]. 

Pt{E) (X n(if) exp{— A/Z cbT} = exp{S (E) / ItB — E/k^T} (5.1) 
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Fig. 10. Photofragmentation spectrum of The intensity of each cluster ion 

is plotted against the number of atoms lost. Without laser one sees only the high 
peak at the left, whose distribution of inner energy Pt{E) is given in the inset. 
The photon energy of hi/ is more than four times the dissociation energy Dn so 
that the mass groups corresponding to a fixed number of absorbed photons are 
clearly separated. The mass peaks shift to the right with increasing temperature 
or photon energy, as indicated. The envelop of the mass peaks, indicated by the 
solid line, is the convolution of the initial thermal distribution Pt{E) with the 
width Pevap(E) due to the fragmentation process, as given by equation (5.6). 

where S{E) = fcBlnn(i?) is the entropy, and fes is Boltzmann’s constant. 
The inset to Figure 10 shows an example for Pt{E); far from the melt- 
ing temperature it is nearly Gaussian, near a phase transition P^{E) can 
become bimodal as seen in Figure 11b. Equation (5.1) is quite power- 
ful, as it gives a relation between temperature, energy and entropy. One 
can, for example, take the measured caloric curve E = E(T), plug it into 
equation (5.1), and obtain entropy and Pt{E). 

5.1.3 Free clusters in vacuum, a microcanonical ensemble 

High vacuum is needed to operate a mass spectrometer, so that the clus- 
ters have to be transferred to a low pressure for mass selection. The aim 
of the experiment is to measure Pt{E). Thus the cluster should not make 
any collisions between the thermalizer and the laser interaction zone which 
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Fig. 11. a) Near a phase transition a small system can show a curious structure in 
the entropy, a dent where the curvature is negative. This “convex intruder” (ar- 
row) is quite small but has far reaching consequences, b) The energy distributions 
P{E) become bimodal near the phase transition Ta. c) For the microcanonical 
caloric curve one has to plot the energy of the extrema of P{E) against the tem- 
perature, which results in “backbending” due to the bimodality. d) The canonical 
caloric curve is a plot of the mean energy [vertical arrows in b)j against the temper- 
ature, which gives always a monotonic curve. Its derivative, f) is always positive, 
while the derivative of the microcanonical curve [e] can become negative. 
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could alter this distribution. Care is taken to ensure this as far as possi- 
ble, e.g. all nets, safe one, are removed from the ion flight path and the 
background pressure is kept sufficiently low. Then the energy of each single 
cluster stays effectively constant after it has left the heat bath, and one 
has a microcanonical ensemble. Averaging over many clusters, one regains 
Pt{E) of equation (5.1). The energy of a single cluster is a sample of this 
distribution. 

5.2 Analysis of the fragmentation process 

The second step of the experiment (see introduction to Chap. 2 and Fig. 4) is 
to measure the energy E, more exactly the distribution Pt{E) of 
equation (5.1). Photofragmentation is used for want of a better method. 
The processes involved will be discussed now. 

5.2.1 Photo-excitation and energy relaxation 

After mass selection the cluster absorbs several photons of energy hv. The 
excimer pumped dye laser has a pulse length of about 10 ns, which is or- 
ders of magnitude longer than electron-electron and electron-vibrational 
relaxation times [37]. The photon induces an excitation of the plasmon 
type [38-41] which decays through a chain of processes according to the 
following scenario: the collective plasmon excitation decays into a single 
electron excitation, which produces through electron-electron collisions a 
hot electron gas. This is turn couples to the vibrations. Only after com- 
plete relaxation, the next photon is absorbed. As the photon energy is 
smaller than the energy needed to eject an electron, no electron ejection is 
seen with a nanosecond laser under low fluence. Only if the time between 
the absorption of two photons becomes of the order of the electronic re- 
laxation time, one observed electron emission. But even under these much 
more violent excitation, no fast emission of atoms can be observed [37]. 

5.2.2 Mapping of the energy on the mass scale 

The original distribution Pt{E) of internal energy is shifted by the absorp- 
tion of n photons of energy hiy to Pt{E — nhu). As the number of photons 
absorbed is not fixed, one obtains for the distribution of internal energy 
f{E) after the laser interaction: 



/(A) =^a,Pr(A-zH, (5.2) 

i 

where the coefficient ai reflect the probability of absorbing i photons. Once 
the energy is sufficiently high, the cluster starts to evaporate atoms [42] on 
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the time scale of this experiment. The more energy a cluster carries before 
photoexcitation, the more atoms it will evaporate afterwards. Thus, the 
mass distribution corresponding to a fixed number of absorbed photons is a 
map of its internal energy distribution U{E) as given by equation (5.2). The 
details of the mapping process are discussed in the caption to Figure 10. 



5.2.3 Broadening of the mass spectra due to the statistics of evaporation 
The evaporation rate kn{E) is approximately given by 

kn{E) = iygexp{-sDn/U{E)} • (5.3) 

Here i' is a typical frequency (10^^ to 10^^ Hz), s = 3n — 6 is the number of 
vibrational degrees of freedom, g is a degeneracy factor usually taken as the 
number of surface atoms, and is the dissociation energy of a cluster of 
n atoms. The time between laser excitation and ejection of the first atom 
is tn ~ kn{E)~^, which decreases dramatically with increasing E. At each 
evaporation step 

Na+(T;) ^ Na + Na+_i(T; - - Am„) + (5.4) 



the energy decreases by essentially the dissociation energy H„. At each step 
of the fragmentation chain 



Na: 



Na: 



kn—l 



Na: 



kn — 2 



Na+_3 



kn—3 



Nal 



.-4 



(5.5) 



the fragmentation rate decreases and the lifetime increases. Even if Pt{E) 
was a (5-function, the mass distribution after the fragmentation has a finite 
width Pevap(A) due to the statistical nature of the photofragmentation pro- 
cess. That is, more than one decay process can occur in the time window 
of the experiment; or in other words, the decay process described by kn-i 
in equation (5.5) will start, before the kn process is completed. 

If the photon energy is sufficiently high (three to four times the energy 
needed to eject one atom from the cluster), the mass spectra Mt(u) show 
an oscillatory structure as function of the number n of atoms evaporated, 
as shown in Figures 5 and 10. The maxima and minima of the oscillations 
are exactly one photon energy apart, which allows an energy calibration of 
the mass scale. Thus one can transform Mr(n) to Mt(A). The envelope 
Mt{E) of the energy-calibrated mass spectrum (see Fig. 10) becomes: 



Mt{E) cx Pt{E) 0 Pevap(A), (5.6) 

i.e. it is given by the original distribution Pt{E) convoluted by the distri- 
bution Pevap(A') due to the evaporation process. 
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There are several other small contributions to the experimentally ob- 
served width: a few percent dimers are emitted at the high temperatures 
attained, the number of collisions with the background gas is small but 
not zero. All these effects can be effectively lumped together to produce 
an effective Pevap- As this distribution is quite symmetric, the maxima of 
Mt{E) and Pt{E) in equation (5.6) coincide for temperatures more than 
10 K above or below the phase transition. 



5.3 Canonical or microcanonical data evaluation 



It was discussed above that the ensemble of clusters has a canonical distri- 
bution of initial energies, but that each single cluster arriving in the laser 
interaction zone corresponds to a microcanonical ensemble. The question 
thus arises, which kind of experiment is performed, a canonical or a mi- 
crocanonical one? In fact neither is performed, but something more funda- 
mental is measured: the distribution Pt{E) of which either a canonical or 
a microcanonical evaluation can be done. 

For a plot of a caloric curve one must know A as a function of the 
temperature. As Pt{E) has a finite width one must have a prescription 
how to evaluate this energy from Pt{E). One has two different possibilities: 
1) if one uses the mean energy 



Acan = J EPT{E)dE^ J PT{E)dE 



(5.7) 



one has a canonical situation [34] . One can show analytically, that in this 
case the canonical caloric curve (Fig. lid) is a monotonously increasing 
function [34,43,44] and the heat capacity (Fig. Ilf) thus always positive. 
2) If one uses the extrema E^-xt of Pe(F) one has a micro- canonical situation. 
This can be seen by differentiating equation (5.1) with respect to the energy. 
The condition for an extremum of Pe{T), 

T = {dS{E)/dE)~^ = Tean, (5.8) 



is identical to the definition of the microcanonical temperature Tcan as the 
inverse of the slope of the entropy [34]. Thus, a plot of E^xt against the 
temperature T of the heat bath is identical to the microcanonical caloric 
curve (Fig. 11c), which is defined as a graph of E against Tcan- 

Except near the phase transition, the caloric curve is quite linear and 
the distributions Pt{E) are thus nearly Gaussians. Maximum and center of 
Pt{E) are almost at the same energy. This conclusion is not altered by the 
statistical evaporation process, as the distribution Pevap of equation (5.6) 
is also nearly symmetric. Thus canonical and microcanonical caloric curves 
obtained do not differ from each other within experimental accuracy. 
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The situation is more complex in the temperature range ±10 K around 
the melting point. The surprising features occurring there are summarized 
below in Section 6.1 and in the caption to Figure 11. The distributions 
Pt{E) are asymmetric (see Fig. 11b, temperatures T 2 to T 4 ), maximum 
and mean are no longer identical, and the two caloric curves differ from 
each other (see Figs. 11c and d). Additionally, neither the maximum nor 
the mean of Pt{E) agree with the maximum of Mt{E) of equation (5.6). 
Experimentally, both caloric curve are smeared out due to the convolution 
with Pevap- For the construction of the caloric curve in Figure 7 the maxima 
of the distribution Mt(A) have been followed; a microcanonical curve is thus 
obtained, averaged by the statistical broadening. 

Summarizing this part one can say that one can either make a canonical 
or a microcanonical evaluation of the data. All caloric curves given here 
and published earlier have been obtained by following the maxima in data 
similar to those in Figure 5. Microcanonical caloric curves are thus obtained, 
averaged over the distribution Pevap(F') of equation (5.6). The averaging 
is only important close to the phase transition, ±10 K away from Tmeit 
canonical and microcanonical curve are the same within experimental error. 



6 Results obtained from a closer look 

6.1 Negative heat capacity 

There exists a surprising theoretical prediction for a small system: its micro- 
canonical heat capacity can (but must not) become negative. An increase of 
energy can - under certain conditions - lead to a lower temperature. Every 
day experience tells us the contrary, if energy is added to a system it will 
get warmer. But negative heat capacities have since long been known in 
astrophysics [43,44], where energy can be added to a star or star cluster 
which then cools down. A similar effect has been calculated for melting 
atomic clusters [4,45] and fragmenting nuclei [35,46]. 

Only the positions of the fragment groups have been used so far for the 
determination of the caloric curves. As discussed elsewhere [47] a nega- 
tive heat capacity cannot be observed in this case. One thus needs more 
information, which can be indeed obtained from the measured fragment 
distributions. 

The entropy S{E) of a small system can exhibit a curious structure near 
a phase transition, a dent with an inverted curvature as shown in Figure 11. 
This structure has been predicted by theory and has been observed in many 
numerical simulations [4-6,35,45]. The inverted curvature of the entropy 
has two interesting consequences 1) The microcanonical caloric curve T^(E) 
gets a negative slope (colloquially called backbending), which means that 




H. Haberland: Cluster Melting 



51 



the corresponding heat capacity becomes negative. 2) The canonical energy 
distribution Pt{E) shows a bimodal structure [4-6,35,45,48]. 

Since Pt{E) is mapped onto the shape of the fragment groups, these 
should become bimodal, too. An observation of the bimodality would there- 
fore be a direct proof of a negative heat capacity. Fragment distributions 
were studied in detail for a cluster for which the effect of a nega- 

tive heat capacity can be expected to be particularly pronounced due to 
its very high latent heat [6, 15, 16]. Unfortunately, the bimodality could 
not be observed directly, as there are broadening mechanisms in the frag- 
mentation process which just smear out these details. One has to apply a 
trick which enhances the modulation of the fragment spectra: for a certain 
photon energy the overlap of adjoining fragment groups produces a pattern 
which allows one an unambiguous decision whether or not the microcanon- 
ical caloric curve shows backbending. 



The details of data evaluation [47] are a bit technical and will not be 
discussed here. One can indeed show that Naj^jy has an entropy with an 
inverted curvature and consequently a negative heat capacity in the energy 
range of the phase transition. A quantitative estimate for this heat capacity 
can be obtained from a least square fit [49] to the data: at the melting 
temperature an increase of the internal energy of Na]^^^ by 1 eV leads to a 
concomitant decrease in temperature by about 10 K. 



How can this negative heat capacity be interpreted? Upon melting, a 
large system converts added energy completely into potential energy, reduc- 
ing continuously the fraction of its solid phase. The kinetic energy and thus 
the temperature remain constant. A small system, on the other hand, tries 
to avoid partly molten states and prefers to convert some of its kinetic into 
potential energy instead. Therefore the cluster can become colder, while its 
total energy increases. 



Negative heat capacities have now been found for melting clusters, frag- 
menting nuclei, and astronomical objects. What do these widely different 
systems have in common? The answer is that in these systems energy is not 
an extensive quantity; i.e. if such a system is divided into arbitrary sub- 
systems, the total energy is not simply the sum over the subsystems. The 
interaction between the subsystems has to be taken into account [34,44,50]. 
For example, in stars it is impossible to neglect the gravity between parts of 
the system [43,44]. Similarly in clusters and nuclei the interaction between 
subsystems is not negligible due to their small size. 
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6.2 Entropy 

The entropy S{E) can be obtained by integrating equation (5.8): 

S{E) = j dE/T{E). (6.1) 

Inserting the inverse of the measured caloric curve T = T{E), one can 
calculate the entropy (up to an additional constant) from the experimental 
data. The result for Na^^jy looks like the one given in Figure 11a. The 
dent is surprisingly small, only about 3 x 10“® eV/K at a total value of 
S{n = 138,T„ieit) ~ 15 eV/K. This method is the only experimental one, 
which allows one to determine entropies for clusters. 

Rewriting equation (6.1) one obtains: 

S{T) = J dTc(T)/r, (6.2) 

where c is the heat capacity defined in equation (2.1) [7]. The measured 
caloric curves are piecewise quite linear and the heat capacity thus constant 
there. This gives a clnT dependence of the entropy and for the density of 
states Et oc T°. As the heat capacity is about proportional to the number of 
degrees of freedom (3n — 6) one sees how dramatically the density of states 
increases with increasing number n of particles in a cluster. 

In principle, one could also say that this experiment measures entropy 
directly, as the energy distribution Pr{E) produced by the source is just 
the exponential of the entropy times the Boltzmann factor (see Eq. (5.1)), 
and this Pt{E) is mapped by the fragmentation process onto the mass 
spectrum. Practically, the convolution integral of equation (5.6) cannot be 
inverted due to numerical noise problems, but outside ±10 K of Tmeit it can 
be solved numerically. 

7 Unsolved problems 

For several interesting problems there is no or only a partial answer today: 

1. how does the caloric curve look at low and very low temperatures? 
A lot of interesting information is expected there. For bulk matter, 
the phonon heat capacity goes as while the electronic one scales 
as T for a metal, becoming the dominant one at sufficiently low tem- 
perature. For finite systems, one has a gap between ground and first 
excited state both in the electronic and vibrational density of states. 
Thus both heat capacities must go exponentially to zero for very low 
temperatures. As the vibrational gap is much smaller than the elec- 
tronic one, the electronic heat capacity will be exponentially small 
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compared to the vibrational one at very low temperatures, even for 
metal clusters; 

2. are there other cluster sizes with negative heat capacity near the melt- 
ing transition? This is expected as many calculations have seen this 
effect; 

3. the experiments have been carried out so far with positively charged 
sodium clusters. What would be the results for neutral, negatively, or 
doubly positively charged clusters? 

4. how can the method discussed here be generalized to other elements 
or even molecules? This will not be a problem with the other alka- 
lines. But for many other elements one runs into a problem: the 
photon energy must be at least three to four times the dissocia- 
tion energy D, otherwise the modulation of the mass spectra seen in 
Figures 5 and 11 becomes too weak, and no caloric curve can hence 
be extracted from them. Dissociation energies of several volts are not 
uncommon, and the necessary powerful lasers are often not available 
at the required high photon energies. Also they will lead to ionization, 
further complicating the analysis; 

5. can one study the caloric curve of the liquid to gas transition for a 
cluster? Experimentally, this is not possible. There seems to be, 
however, a possibility to construct this curve; 

6. can other types of phase transitions be measured, e.g. the magnetic 
one, where de Heer and coworkers have published a beautiful first 
attempt [13]. Unluckily, the ferromagnetic materials have high disso- 
ciation energies, and one runs into the problem discussed in point 4 
above. 

8 Summary and outlook 

The melting of a small finite system in vacuum has been discussed. The 
experiments have been carried out with positively charged sodium clusters, 
but many results should be independent of the chemical nature of the el- 
ement under study and thus be quite general in nature. The main results 
can be summarized as: 

1. the caloric curve of free, mass selected clusters in vacuum has been 
measured. The measuring process can be divided into two parts: 

• sodium cluster ions, Na^, are thermalized, which gives them a 
canonical distribution of internal energies. One cluster size is 
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selected. This prepares a cluster of known temperature T, con- 
taining an exactly known number of constituents; 

• the selected cluster is irradiated with a laser, and the distribu- 
tion of photofragments is measured as a function of the cluster 
temperature. From this the total internal energy E of the cluster 
can be determined. 

Knowing E and T, one can construct the microcanonical caloric curve 
E = E(T), averaged over an experimentally unavoidable broadening; 

2. the melting point, and the latent heat of fusion can be directly read 
off the caloric curve; 

3. the melting points of Na+, 55 < n < 340: 

• are about one third lower than in the bulk; 

• they fluctuate by about ±50 K; 

• the physical origin of fluctuations is not known; 

4. the latent heat of fusion shows similar fluctuations as the melting 
point. 

5. the microcanonical heat capacity is negative near Tmeit for NaJ^^. 
Theory indicates that this should be not an uncommon behaviour for 
other clusters, but the broadening due to the photofragmentation has 
so far precluded its observation, save for this one favourable case; 

6. the entropy of the size selected cluster can be determined, save near 
Tmelt- 

There are several other fields of science where similar phenomena are ob- 
served or applied. The atomic nucleus is a finite system for which phase 
transitions have been studied intensively. Due to its large zero point motion, 
the nucleus does not become solid. But it shows a liquid to gas transition, 
which has interesting similarities to the cluster case [46], i.e. it can have 
also a negative heat capacity. The determination of energy and temperature 
are more involved in the nuclear case. 

Finally, are there any applications of the size dependence of the melt- 
ing point of small particles? This plays an important role in the softening 
and melting of polymers [51]. Even a medical application has recently been 
proposed [52] . A drug could be encapsulated in or bound to a tiny particle, 
whose melting point is adjusted via its size to be just above the body tem- 
perature. By an external warming of a specific part of the body the particles 
could be forced to release the medicine exactly there. One can thus have a 
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high drug concentration in some part and a very small one in another part 
of the body. As all drugs have unwanted side effects, the potential of such 
a technique would be great. 



Evidently, the work presented results from the cooperative effort of a whole team of peo- 
ple. Contributed have (in alphabetical order): Jorn Donges, Thomas Hippier, Bernd von 
Issendorff, Werner Kronmiiller, Robert Kusche, Ralph Schlipper, and Martin Schmidt. 
Financial support came from the Deutsche Forschungsgemeinschaft through SFB 276. 
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EXCITATIONS IN CLUSTERS 



G.F. Bertsch 



1 Introduction 

There are two points of view on cluster physics, which are implicit when 
one asks the questions: 

Do the properties of clusters bridge the physics of atoms to the physics 
of bulk systems? 

Or do clusters form a world of their own, with unique properties seen 
neither in atoms nor in macroscopic systems? 

A positive answer to the first question would give a strong motivation to 
study clusters in order to develop tools for studying large systems. Because 
clusters are finite with discrete electronic excitations, their observed prop- 
erties might presumably be used more effectively to develop better theory. 
On the other hand, a positive answer to the second question is motivation 
to study clusters in their own right. In fact, depending on the properties 
under study, both questions have affirmative answers. 

One of the unique aspects of clusters is their magic numbers, which will 
be discussed in detail in the lectures of Martin and Brack. A sampler of 
abundance spectra showing the magic number phenomena is displayed in 
Figure 1. 

In the top panel are shown the abundances of sodium clusters in the 
pioneering 1984 experiment of Knight et al. [1]. The numbers 8, 20, 40, are 
clearly favored. These numbers can be associated with the group SU(3). 
This is a fancy way of saying that the experiment showed special stability for 
the closed shells in a three-dimensional harmonic oscillator potential, which 
happens to carry the symmetry of the group SU(3). The most important 
point is that spherical shell closures of delocalized electrons convey stability 
to the system and determines structure. In nuclear physics the discovery of 
shells of delocalized particles opened the door to more powerful theoretical 
tools to understand nuclear structure, and the same has happened in metal 
cluster physics. 



© EDP Sciences, Springer- Verlag 2001 




60 



Atomic Clusters and Nanoparticles 




10 20 30 40 50 60 70 80 

N 



Fig. 1. Abundance spectra in cluster beams. The data are from the following 
references: carbon [4]; sodium [1]; xenon [3]. 



Magic numbers of very different origin come from the geometric arrange- 
ments of atoms in a cluster. The Mackey icosahedra [2] are the prime ex- 
ample of geometric packing in finite systems. Let us try to pack hard balls 
into a compact shape. Starting with one at the center, one finds up to 
twelve can be positioned around the central ball and touching it. However, 
the packing is imperfect in that the outer layer of balls is not close packed, 
and contains spaces between at least some of the balls. The space can be 
arranged symmetrically putting the outer balls at the vertices of an icosa- 
hedron. Each face of the icosahedron is defined by a triangle of balls. The 
Mackey construction adds layers by building a larger triangle on each face. 
These icosahedral numbers were seen first in a cluster beam experiment on 
Xe clusters [3], shown in the lower panel of Figure 1. The favored icosahe- 
dral numbers are 13, 55, 147, ... However, one sees in this experiment that 
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Fig. 2. Schematic infrared absorption spectrum of Ceo, from reference [5]. 



the icosahedral numbers hardly stand out from others equally prominent. 
Martin’s lectures will explain how in the basic icosahedral framework other 
numbers can arise as well. 

The middle panel in Figure 1 shows carbon abundances with the famous 
peak at fV = 60 from the fullerene molecule Ceo- In this case, the experi- 
mentalists observed that 60 was magic, and deduced from that the structure 
should be icosahedral, with the carbon atoms positioned like the vertices on 
a soccer ball. In some respect, Ceo is an ideal molecule to bridge the atom 
and the bulk: it is the largest assemblage of atoms possible in which all the 
atoms are equivalent. It also has the largest possible number of point group 
symmetries (120). The history of the synthesis of Ceo is also interesting 
from another point of view. The original discoverers, Kroto, Smalley et al., 
were chemists by background but made use of physical techniques-cluster 
beam apparatus and time-of-flight mass measurement-to first observe the 
molecule [4]. But the practical bulk synthesis was devised by physicists, 
Kratschmer and Huffman [5], using extraction techniques taken directly 
from the chemistry laboratory. Thus one sees in the study of clusters a 
blurring of the distinctions between physics and chemistry. Once macro- 
scopic quantities became available, it was possible to measure properties 
that would be extremely difficult otherwise. An example from the original 
paper is the infrared absorption spectrum, shown in Figure 2. Due the high 
symmetry of the molecule, there are only four optical active transitions, 
although the number of vibrations is much larger, 2iN — 6 = 174. We will 
come back to this spectrum in Section 4. 
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Photon Energy (eV) 

Fig. 3. Surface plasmon in alkali metals. Left, KJqq from reference [7]; middle 
and right NaJg 4 j from reference [8]. 



There are many properties of clusters that can be measured in clus- 
ter beam experiments. The ionization potential can be measured rather 
directly in photoionization experiments from the threshold energy. The 
binding energy of an atom to the cluster is measured indirectly through the 
evaporation rate and its temperature dependence, using statistical theory. 
A recent example of the application of the theory to determine the atom 
separation energy in Na clusters may be found in [6]. 

The response of clusters to external electromagnetic fields is a large 
subject, and will be the main topic of my lectures. An external electric field 
£ induces a dipole moment D\ the linear polarizability a is the coefficient of 
proportionality in the expansion D = a£ -I- ... This is commonly measured 
by deflection of a beam in an inhomogeneous electric held. One can also 
study magnetic clusters by their deflection in an inhomogeneous magnetic 
held, the classic Stern-Gerlach experiment. Magnetic properties of clusters 
will be a topic in Pastor’s lecture. 

The photon absorption cross section is also measured over a frequency 
range starting from optical frequencies to the very far ultraviolet (~10 eV 
photon energy). In simple metal clusters one sees a coherence between the 
electrons in that there is strong peak in the response involving all of the 
valence electrons. This is the surface plasmon mode, and I shall have quite a 
bit to say about it. Figure 3 shows some typical optical absorption spectra 
of metal clusters. In a spherical cluster, the surface plasmon is sharply 
defined, as may be seen in the first panel, showing potassium spectra [7]. 
The plasmon is split in nonspherical clusters, as shown in the second panel. 
This geometric sensitivity of the plasmon may be understood at many levels, 
from classical to quantum mechanical. This feature of cluster behavior is 
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also reminiscent of a similar phenomenon in nuclear physics. I will show how 
an analytic rough description be derived looking at the short-time behavior 
of the wave function. 

The main aim in these lectures is to give the student some familiarity 
with the basic tools the theorist has at his disposal to study these cluster 
properties, particularly electronic excitations. In this respect one should 
emphasize the importance of simple models. For describing large numbers 
of particles, the use of models that focus on a few degrees of freedom is un- 
avoidable. Even if one’s efforts are in large-scale ab initio numerical calcu- 
lations, the models are extremely useful to interpret the numbers and check 
the numerics. My own cluster research, published in the papers [9-31], has 
evolved from applications of simple ideas taken from nuclear physics to nu- 
merically intensive computations of ab initio theory. While I will show some 
of these ab initio results that are won by intensive computer computation, 

1 think one of their main values is to validate simpler models. 

Before discussing electronic excitations, I will go through statistical reac- 
tion theory, which is indispensable to interpret cluster formation and evap- 
oration, and many electron transfer processes. 

2 Statistical reaction theory 

An important tool for understanding dynamic processes in clusters, as well 
as for extracting information about their energetics, is statistical reaction 
theory. Like statistical mechanics, the theory is well-known in principle. 
But much of the literature is confusing, and I think it is worthwhile to as- 
semble all the relevant formulas and their derivations. The generic problem 
is to calculate the emission rate of an excited state of a bound system. The 
derivation of the statistical rate formula is based on detailed balance, which 
is one of the fundamental principles of statistical mechanics. If we divide 
the system’s phase space up into microstates, the equilibrium state has all 
the microstates equally occupied. Looking at the detailed dynamics, the 
probability flux out of each microstate must equal the probability flux in. 
The inward flux is relatively straightforward to compute or to character- 
ize empirically. Then making the equality gives us a way to calculate the 
emission rate. Note that the emission rate does not depend on the system 
being in equilibrium with its surroundings. It only depends on there being 
equilibrium among the internal microstates of the system. 

We sketch a system. Figure 4, with one degree of freedom singled out, 
say the position of the particle that will be emitted. All the other degrees 
of freedom will be treated by imagining the Hamiltonian to be diagonalized, 
and so represent them by states. In the scattering problem these states with 
a fixed radial coordinate for the particle are called channels. We now put the 
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Fig. 4. Separation of internal and external states for the derivation of the statis- 
tical decay formula. 



system in a box to make it easier to count states for the detailed balance 
argument. The thick curve in Figure 4 shows the energy of the lowest 
channel as a function of distance. The states of the system, separated into 
internal and external states, are shown with light lines. These are not the 
eigenstates of the system, because we have not coupled the internal to the 
external states. Let us now count the states and invoke detailed balance. 
For the external states, let us give the box a radius R. We will count the 
number of states in an energy interval AA, and we need the density of 
external states with respect to energy. 

This is calculated as follows. For a large box, we can take the normalized 
asymptotic wave function to have the form^ 




^This ignores the scattering phase shift and the angular-momentum-dependent phase 
of the asymptotic wave function. 
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where n labels the different states. The density of the external levels dE/dn 
is given by^ 

dn Rm 
dE h^irk 

Let us now find the inward current in an energy interval AE. The asymp- 
totic external wave function has both incoming and outgoing components, 
with the incoming component given by 



(j) = 



V ~R 2 



The current associated with this component is 



(j) 



Uk 2 
4to R 



When the particle reaches the cluster, it may be reflected immediately or 
it may be transmitted into the interior. Define the transmission coefficient 
Tc, with 0 < Tc < 1. Then the flux that goes into the target is the current 
of each state times the number of states times the occupation probability 
of each state. Calling the last quantity /o, the inward flux is 

Ht? 'T' 

AE = fo^AE. 



By detailed balance we can equate this to the outward flux, assuming that 
the internal states have the same occupation probability /q. Let’s define an 
average decay rate of an inner state, W. The total fiux out is given by 



JoWpeAE 



where pe is the density of internal states of the system. Equating the last 
two expressions gives the statistical formula for the decay rate W, 



W = 



EcTc 

2TrhpE 



( 2 . 1 ) 



Note that the derivation does not require the system to be in equilibrium 
with its surrounding, only that there is an internal equilibrium among the 
internal states. 

To use the formula, one needs to know the transmission coefficients as 
well as the level density of the internal states. In many applications the 



^This may be derived from the boundary condition that the wave function vanish at 
the surface of the box^, = nn, taking the asymptotic energy, E = Ti^k'^/2m. 
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channel potential energy has a barrier, and one can make the approximation 
that the transmission coefficients are zero below the barrier and unity above. 
Then there is a correspondence between the channels and the level density of 
the daughter system. With these assumptions, one obtains the well-known 
transition state theory, called RRKM theory in chemistry. In principle the 
channels are discrete, and at energies close to the barrier energy the decay 
rate might show a step-like behavior. In fact, this was predicted in nuclear 
physics in the original fission theory of Bohr and Wheeler, which made use of 
equation (2.1). However, the discreteness of channels was never clearly seen 
in nuclear fission, because the quantum mechanical transmission coefficients 
increase smoothly as the channel opens. But in my last lecture I will show 
you an example from carbon structures that perfectly shows the individual 
channels. 

The statistical decay formula also requires the density of states of the 
decaying system, pE- Under ideal conditions when the internal states do 
not overlap, {W < {Tipe)~^), the internal states can be explicitly counted. 
This is the case in nuclear physics for energies close to the neutron emis- 
sion barrier. For atomic clusters and molecules larger than several atoms, 
the vibrational density of states is huge compared to the electronic, and 
there is no hope to see individual internal states in the regime of electronic 
excitations. 



2.1 Cluster evaporation rates 

We now apply the formula to evaporation of atoms from a cluster. We 
consider a cluster having excitation energy E*, emitting an atom of kinetic 
energy Ek and leaving the daughter cluster at excitation energy Ei. The 
energies are related by 



E* = E, + Ek + D (2.2) 

where D is the binding energy of the atom to the cluster in the ground state. 
In general, the atom can be emitted in many angular momentum states, and 
each one will have its own channel. However, in the end the incoming flux is 
determined by the inverse reaction cross section. To derive a simple formula, 
let us assume that the atoms are spinless and that the specific state of the 
cluster play no role in the absorption cross section. Then the transmissions 
coefficients depend only on the orbital angular momentum I and the energy 
of the atom, and the reaction cross section is given by 

<^r=^J2^2l+im{EK) 
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where /2m = Ek- Exactly the same sum will appear when one makes 
an approximate sum over channels in equation (2.1). The channel are distin- 
guished by the internal state of the daughter cluster (including its angular 
momentum L), the orbital angular momentum of the evaporated atom I, 
and the total angular momentum of the system J. For a given initial state 
the total angular momentum is fixed, so the channel sum will involve sums 
over I and L. The conditions of evaporation are generally such that the 
maximum I is much less than the typical J. Then the channel sum over L 
can be enumerating with the familiar rules of angular momentum coupling, 
L = J — l,J — l+l , ..., J + 1, giving 2^ -I- 1 terms altogether. If the particle 
has spin with g states, the spin coupling increases the number of channels 
to g(2? -I- 1). Then assuming that the internal states of the daughter are 
independent of L, the formula (2.1) can be expressed 



, gE.E,( 2 i + l)T, 

2ttTlpe 

Here the i sum is over internal states of the daughter for some typical angular 
momentum L. The transmission coefficient depends on the i only because 
the energy of the evaporated particle depends on how much energy is left in 
the daughter. We now replace the sum over I by the cross section formula, 
to get 



Tx, _ 9Y.rk'icf,{EK) _ gmJ2^EKCFr{EK) . 

27T^npE E^n^pE ' ^ ’ 

Here the evaporated particles momentum k and kinetic energy Ek depend 
on i through equation (2.2). The differential decay rate with respect to Ek 
is easily obtained by the replacing the sum over internal states by their level 
density, dSj/dE ^ pE- The formula is. 



dW _ gmpN-i{E* - D - EK)EK<Jr{EK) 
dEK TT^h^ pn{E*) 



Here we have changed notation on the level density p to distinguish the 
parent density pN and the daughter density pn-i by a subscript. This 
formula was first derived by Weisskopf to describe neutron decay of excited 
nuclei [32]. 

Further reduction of this rate formula requires some specific information 
about the level densities. One simple limit is to assume that the excitation 
energy and level densities are high enough that the level density of the 
daughter system behaves exponentially over the range of the interest in the 
decay. This permits a parameterization with a temperature: pn{E* — D — 
Ek) ~ Pn{E* — D) exp(— Eif/T), and equation (2.5) can be integrated with 
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respect to Ek- If we further assume that the cross section is constant, the 
integral is elementary and the total transition rate becomes^ 

^ _ mT‘^pN-i{E* - D)ar 
TT^h^PNiE*) 



We now specialize to level densities associated with vibrations. If the vi- 
brations are harmonic, the level density can be expressed completely ana- 
lytically by Kassel’s formula [33], 



Pn{E) 



Es-i 



( 2 . 6 ) 



Here s is the number of vibrational degrees of freedom, with s = 37V — 6 for 
triatomic and larger clusters. The sum goes over the vibrations j with their 
frequencies uij. Since the formula will be derived in Wales’ lectures, I need 
not discuss its validity here. 

Again assuming that the reaction cross section is constant, we can inte- 
grate over the density of states of the daughter system to get the following 
formula 



W = — (s — l)TO(Jr 
7T^ 



{E* - DY~^ 

{E*y-'^ 



(2.7) 



Here is the ratio of the products of vibrational frequencies for parent and 
daughter. The decay rate in essentially this form was originally derived by 
Engelking [34] . Note that the quantum of action U has dropped out of the 
formula. Once one takes Kassel’s density of states, the remaining physics is 
completely classical. 

For midsize and larger clusters there are many vibrational degrees of 
freedom, s 1. It also usually the case that the evaporation is observed in a 
regime with E* ^ D. Then the conditions for an exponential approximation 
are satisfied, and we can write {E* — DY~‘^ / {E*Y~‘^ ~ exp(— D/T) where 
T is defined T = E*/{s — 2). Note that the relation between T and E* is 
exactly what one obtains classically for the relation between temperature 
and energy in a system of s — 2 oscillators. The decay rate formula then 
becomes 



W = 



oj^mar -d/t 
7t2T ® 



( 2 . 8 ) 



The most important feature of this formula is the extreme dependence of 
the evaporation rate on D. Let us take the example of sodium cluster of 



®We have also assumed that the spin of the particle can be neglected (g = 1). 
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size N « 100. The binding energy of an atom is about D « 1 eV, and we 
will approximation w by the Debye frequency, Tioj « 0.01 eV. Let us also 
assume that T in the range of values near room temperature, T = 0.025 eV, 
to evaluate the prefactor in the formula. Although equation (2.8) does not 
have Ti in it, it is convenient to express frequencies as equivalent energies. 
Two more needed dimensional constants: fi = 0.658 eV-fs is needed to get 
a decay rate in units of s“^. The mass is conveniently expressed in terms 
of f? juiYi = 4.15 X 10“^ eV-A^. The inverse cross section is estimated as 
(Jr = « 300 A, that is, assuming that all atoms that hit the cluster will 

stick. Putting the numbers together, we find 

w ~ (0.01)^(23)(300) D/T ^ iq16 -D/T 
^ 7t2(0.00415)(0.025) 

Cluster beams have flight times of the order of tens to hundreds of mi- 
croseconds. Thus a measurable evaporation rate requires the exponential 
suppression factor to be in the range exp{—D/T) « 10“^^, i.e. D « 25T. 
We can see from this that a 10% change in D will change W by an order of 
magnitude. For T = 0.03 eV, the excitation energy is E* = 9 eV, and the 
evaporation leaves the daughter clusters at the energy if « 8 eV. Thus, the 
chain of evaporations has very different rates for each stage, and the multi- 
ple evaporation spectrum will be very much peaked around the number that 
corresponds to a lifetime equal to the travel time in the cluster beam. As 
you will see, Haberland will use this property to accurately measure energy 
differences between clusters at different temperatures. 

2.2 Electron emission 

The statistical theory of electron emission is no different in principle, but 
there can be important differences in the details. Because of the Coulomb 
interaction, the geometric area may no longer be a good approximation to 
the reaction cross section. 

Haberland will discuss in his lectures measurements of the electron emis- 
sion in sodium clusters. He will show a very interesting result that the emis- 
sion takes place when the electrons have equilibrated among themselves but 
before the equilibration with the vibrations has taken place. 

Under these conditions, the state densities in equation (2.5) should be 
evaluated for the electron degrees of freedom only. The Fermion character 
of the electrons has two consequences. The two spin degrees of freedom 
gives rise to an additional factor of 2 in the formula, because there are twice 
as many external electron states when detailed balance is applied. Also, 
the level densities are quite different for Fermions. If we assume that the 
particles behave as a free Fermi gas, the level density has an exponential 
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dependence on excitation energy given by Bethe’s formula, 

PE ~ exp (2^tt^NE/2Ey) , 

where Ep is the Fermi energy of the electron gas. It is interesting to note 
that in the Fermion case the quantum effects remain visible in the final 
formula as the factors of fi no longer cancel. 

A final difference is that the inverse cross section cannot be considered 
constant, due to the long-range Coulomb interaction between the electron 
and the (charged) daughter system. The semiclassical formula for the cross 
section when there is complete absorption at a radius R in the present of a 
potential field V is 



Gah = ttR^ 





Note that this diverges at low energy when the Coulomb potential is attrac- 
tive. The divergence is only apparent, because the cross section is multiplied 
by E in the rate formula. This theory of charged particle emission is well 
known in nuclear theory; it was first applied to clusters in reference [35]. 



2.3 Radiative cooling 

The last process I will discuss is the statistical emission of a photon. To 
observe radiative cooling in isolated clusters, one needs longer residence 
times than is provided by the usual molecular beam apparatus. Observa- 
tions have been reported using ion traps [36] and storage rings [37]. In these 
experiments, the photons were not observed directly, but rather the effect 
of the cooling on other processes was seen. Also, it is interesting to note 
that cluster radiation was proposed as a way to achieve more efficient ther- 
mal illumination [38]. We will see that in the spectrum infrared emission 
is suppressed compared to black body radiation. In the derivation of the 
photon decay rate, only the transmission factor in the dipole channel {I = 1) 
is significant, but the formula looks very similar to equation (2.3). Photons 
have two polarization states, making g = 2. Also the energy of the photon 
ifph and the reduced wave number k are related hy k = Ep\^/Tic, so the final 
result for The result for the radiation spectrum is 

- gph) 

difph ^2 ■' pe{E*) ^ ’ 

This formula is also well-known in nuclear physics where radiative decays 
of hot nuclei have been studied for some time. In the cluster context, the 
formula was first derived in reference [39] . 
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3 Optical properties of small particles 



In the remainder of my lectures I will mainly discuss the theory of electronic 
excitations, but before getting into the equations to be solved, I will list for 
future reference a number of useful formulas for describing the electromag- 
netic properties of small particles. Here by small, I mean the wavelength 
is much larger than the size of the particle, X ^ R. Then the electric 
dipole field dominates the interaction, and the optical response can be de- 
scribed with the dynamic polarizability a{Lu). For a spherical system, the 
polarizability is defined 






1 



1 



—Lu — iri + LUi u + ir] + Ui 



(3.1) 



with i labeling excited states. For nonspherical systems, one can define 
polarizabilities for the principle axes and construct a polarizability tensor. 
The photon absorption cross section is related to the polarizability by 

47T0J , , , , 

(Tab = hna[uj). (3.2) 

c 

An important property of the dipole response is the Thomas-Reiche-Kuhn 
sum rule. It may be derived from the operator commutator relation, 
[z, [H,z]/2 = /2me- In terms of dipole matrix elements between ground 

and excited states i, the sum rule reads 

Y^\{0\z\b)\^hu;,= ^N, (3.3) 



where N is the number of electrons. Conventionally one defines a dimen- 
sionless oscillator strength fi for a transition as 

‘^771 

f. = -^mz\i)\Hu;.. 

Then the sum rule is simply fi = N ■ In terms of the dynamic polariz- 
ability, an oscillator strength function can be defined 

5'/(w) = ^^^Ima(u;) (3.4) 



and the /-sum is 



J dhujSf{uj) = /. 



(3.5) 
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Another handy formula is the expression for / in terms of the integral of 
the cross section over the photon energy, 

f ‘ 2^ J 

A practical formula used by chemists takes for the integrand the molar 
extinction coefficient rather that the cross section; a derivation may be 
found in my book [18]. 



3.1 Connections to the bulk 

With clusters viewed as a bridge between the atom and the bulk, it is in- 
teresting to see how the electronic response connects to the infinite medium 
response. There the response is characterized by the dielectric function e, 
which gives the relation between external and internal electric fields at a 
perpendicular interface: e = S-mt/Cext- There are two connections one can 
make to the finite system polarizability a. The first is the dielectric func- 
tion for an cubic crystal composed of polarizable particles, given by the 
Clausius-Mossotti relation, 

_ 1 -I- 87rcmo/3 _ 1 -I- 2a/r^ 

1 — 47rano/3 1 — a/rj] 

where no is the number density of the particles in the crystal. For con- 
venience in the second equation the formula is expressed in terms of the 
Wigner-Seitz radius tq. This is the radius of a sphere whose volume corre- 
sponds to the volume per particle in the medium, 47rrg/3 = l/ng. A similar 
length Ts is in common use to specify the electron density Ug. This is de- 
fined 47rrg/3 = l/ug and is quoted in atomic units (lengths in Bohr radii 
(0.529 A); energies in Hartrees (27.2 eV). 

The other connection is the polarizability of a cluster considered as a 
dielectric sphere. It is a textbook exercise in electrostatics to show that the 
polarizability of a dielectric sphere of radius i? is given by 

a = (3.8) 

We shall call this the Mie theory of the polarizability. One more handy 
formula is the dielectric function of a simple conductor. In the Drude model 
(independent electrons subject to a frictional force depending linearly on 
velocity) the dielectric function is 
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where 7 is the frictional rate constant and Wp is the plasmon frequency, 



ui 



2 

p 



dTre^rie 

TOe 



(3.10) 



In atomic units the formula is simply = S/r^. 

Let us try this in equation (3.8). After some trivial algebra one finds 



a(w) = 






+ uj. 



;/3 



(3.11) 



There are two obvious things to notice about this formula. The first is 
that the zero- frequency limit gives a = R^, which is just the formula for 
the polarizability of a conducting sphere. The second is that there is a 
resonance where the denominator vanishes, near the frequency 



Us = 





(3.12) 



This is the surface plasmon, and we shall call equation (3.12) the surface 
plasmon formula. In atomic units it is simply Us = Ts 



3.2 Linear response and short-time behavior 

The photon absorption cross section and the polarizability are properties 
that only require the dynamics in weak external fields, the linear response 
region. This allows a variety of methods to be applied to solve the time- 
dependent equations, and each method has its adherents. There are two 
very popular methods that I will not describe in any detail in these written 
notes, but they need to be mentioned for completeness. In these methods 
one makes a matrix representation of the problem and either inverts or 
diagonalizes the matrix. Both methods start by Fourier transforming the 
Kohn-Sham equations in time, to obtain equations in the frequency domain. 
In the linear response region, different frequency are not coupled together 
except that the amplitude for a given frequency is coupled to that of its 
negative. 

The direct solution of the time-dependent equations is possible, and 
I have pursued this method with my collaborator, Kazuhiro Yabana, using 
an algorithm originally developed for nuclear physics [40] . With the time- 
dependent equations, one can apply an external field of arbitrary shape and 
strength. If the field is weak, the response of a given frequency is obtained 
from the amplitude of that Fourier component in the excitation field. It 
is very convenient to take the perturbing field to be a (5-function in time. 
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because it has all Fourier components with equal weight. Let us first see 
how this works with the one-particle Schrodinger equation, 



H4> = 




U{r)(j) 




Here U{r) is a fixed internal potential the system. In this section I have 
dropped factors of fi to simplify the formulas. I start with the ground state 
4>o exp(-iEot) and add to the Hamiltonian an impulsive external potential, 
Ve^t{r,t) = Vo{r)S{t). The differential equation then requires that there be 
a corresponding jump in the wave function at t = 0+, 



= (1 - *^o) 00- 



The subsequent evolution can be found by solving the equation as a power 
series in time. The form of the power series is 

= (1 - iVo - [H, Vo]t + i[H, [H, Vo]]f/2 + ...) (3.13) 



The low-order terms in this expansion have direct physical interpretation 
in classical Newtonian mechanics. The initial perturbation of the wave 
function gives it the same momentum density that one would calculate from 
an impulsive force field VVjjxt acting on a density distribution of classical 
particles. This may be seen from the expansion of the expression for the 
momentum density to linear order in the external field. 



p{r) = (p*{0+) 



V - 



2i 



7^ j 



-Wo|((.o|". 



(3.14) 



Here the right hand side is just the momentum density associated with the 
force field VVj,xt and the ordinary density no = Next let us look 

at how the density varies in time. To first order in t the time-dependent 
density is given by 

n{t) = |(()(t)P = |())oP - 2(j)g[H,Vo]4'ot+ ■■■ = no - —noVVot+... 

m 

Taking the derivative of this expression to get dn/dt, and comparing with 
equation (3.14), one can recognize the equation of continuity 

dn ^ V 

— = -V • p[r)/m = -V • j 

where j(r) = p{r)/m is the particle current. The next term in the power 
series will give the initial change in momentum due to the internal forces 
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in the system. It is rather complicated to work out the double commutator 
for the next term in general (but see [9]), so I will specialize to the very 
importance case of the dipole field, Vq = Xz. Here A is a small parameter 
to remind ourselves that the response is to be calculated to linear order in 
the external field. The relevant commutators are 



[H,Vo] 

[H, [H,Eo]] 



m dz 
m dz 



(3.15) 



Inserting in equation (3.13), the wave function evolves as 




— iXz + i 



.Xe dU 



2m dz 






A ddo. 


m oz 



—iEot 



The generalization of this expansion to a many-particle Hamiltonian with in- 
teractions between the particles is very similar. Let us consider 
the Hamiltonian H = ^■p‘^l2m + ~ C) 

the ground state wave function dro. Then the power series expansion of the 
wave function is 




I 



Note that the last term in parentheses only contains the one-particle po- 
tential U. The particle-particle interaction does not contribute because the 
relevant commutator vanishes. 






= 0 . 



I next write down the power series expansion of the dipole moment and 
the total momentum. With a little bit of algebra, these quantities can be 
expressed as follows 



i?(t)^(vI/(t)|z|vI/(t)) = AAt+... 

m 



(3.16) 



and 




(p) ^ (vl/(t)|p,|vl/(t)) 



m 



(3.17) 
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It is easy to connect this time-dependent analysis with the dynamic polar- 
izability function considered at the begin of the section. Formally, we can 
expand the time-dependent wave function following the impulsive dipole 
field in the eigenfunctions, 

i 

The dipole moment and the total momentum then have the following time 
dependencies. 



m 



(vl/(t)|^|vl/(t)) 



A^2 (0|z|*) 



sin Wit 



A^ 2(0|pz|z)(t|z|0) cosojit. 
i 



(3.18) 

(3.19) 



In a power series expansion, the first term of the dipole moment is 



2wi(0|z|z)^t. 

/ 



Comparing with equation (3.16), we see that they are equal if the TRK sum 
rule is satisfied. Thus the sum rule is just a statement about the short-time 
behavior of the wave function in an impulsive external field. 



3.3 Collective excitations 

It is possible to derive simple formulas for collective excitations just using 
the dynamics that was derived in the last section. From equation (3.19) we 
see that the power series expansion of the momentum has the first terms 






We can use this to estimate the frequency of the excitation, if the system 
is very collective. That assumption would allow us to replace the sum over 
states in the above equation by a single state with an excitation energy 
I will call (jJc- Comparing with equation (3.17), we find 



w 



2 

c 




(3.20) 



This formula can be used to make simple estimates of collective frequencies. 
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Let us first consider the spherical jellium model. The background charge 
density is noQ{R — r) and the corresponding potential is 



Ane'^noR^ 

3r 

dTre^noi?^ /3 \ 

3 \2 ~ 2^ ) ’ 



r > R 
r < R. 



If the electrons are contained within the jellium sphere, the integral in 
equation (3.20) reduces to /J^d^rn(r) = N and we immediately get the 
surface plasmon formula, equation (3.12). We thus see a condition of va- 
lidity of the formula: the electrons must be contained within the jellium 
sphere radius. In fact the electron wave functions will spill out, decreasing 
the integral and thereby the frequency of the collective oscillation. 

A simple extension of the model allows us to derive a formula for the 
splitting of the surface plasmon frequency in deformed systems. Let us 
suppose we have a small quadrupolar deformation with the radius vector 
depending on direction according to 



R{ 6 ) = R{1 + (3P2{cose)) 



where P 2 is the Legendre function and /? is a dimensionless deformation 
parameter. Taking [3 small, the charge density distortion of the jellium 
background is approximately 

5n = PRnoP 2 {cos 6 )S{r — R). 



The Coulomb potential associated with this charge density be obtained from 
the multipole expansion. The result for r < i? is 






The mode in the x or y direction will now have a different frequency from 
the 2 mode because the respective second derivatives of the field U + SU 
differ. Working this out, we get to leading order in j3 



c,: = ojy = u;s (1 + 3/3/5) 
w/ = Ws (1 - 6/3/5) . 



4 Calculating the electron wave function 

From a computational point of view, it is extremely difficult to calculate 
many-particle wave functions for more than a few particles, because the 
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Hartree-Fock 




Fig. 5. Hierarchy of quantum many-body theories. 



number of functional values needed in a numerical representation increases 
exponentially with the number of independent variables. On the other 
hand, the calculation of single-electron wave functions in an arbitrary three- 
dimensional potential is within the scope of desktop computers. Fortunately, 
there is a theory that is quite accurate although it only involves the solving 
the one-particle Schroedinger equation. 

The most sophisticated version of the theory is called the time-dependent 
density functional theory or TDDFT for short. I will show a number of 
calculations done with TDDFT, but before that it is useful to review the 
static theory. This is the density functional theory (DFT). For systems 
larger than a few atoms, it has revolutionized the computation of electronic 
properties, as indeed was recognized in the 1998 Nobel Prize. While it has 
been spectacularly successful in calculating static properties, it does have 
well-known deficiencies in treating excitations. It is therefore important to 
understand the approximations and their validity. 

The kinds of theory one can make for electronic structure are shown 
in Figure 5. In my view, Hartree-Fock theory is fundamental to all proven 
approaches. The theory is very simple to describe: minimize the expectation 
of the Hamiltonian in the space of Slater determinants. You have seen the 
basic objects many times before: 



'i’ = AY\_4>i{ri), {(pi\(l)j) = 






2m 

I 

^ 3 .3 ,ne(r)ne(r') 



d^rd^ 



r — r 




(4.1) 



(4.2) 



where ne(r) = is the electron density. The variation 

6E/6(f>*{r) = 0 gives the Hartree-Fock equations. The single-particle energy 
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Table 1. Atomization energies. 





Li2 


C 2 H 2 


20 simple molecules 
(mean absolute error) 


Experimental 


1.04 eV 


17.6 eV 


- 


Theoretical errors: 
Hartree-Fock 


-0.94 


-4.9 


3.1 


LDA 


-0.05 


2.4 


1.4 


GCA 


-0.2 


0.4 


0.35 


T 


-0.05 


-0.2 


0.13 



in the Hartree-Fock equations arises as a Lagrange multiplier to preserve 
the norm of the wave function. While Hartree-Fock is conceptually sim- 
ple, on a practical level it is not accurate enough to be useful for chemistry 
or for computing cluster structures. The level of accuracy for several sim- 
ple molecules is shown in Table 1, taken from Perdew [41,42]. The mean 
absolute error in the atomization energies (energy difference between the 
molecule and the individual atoms in isolation) is 3 eV in the Hartree-Fock 
theory. The predicted binding of the Li 2 clusters is a factor ten too low, and 
another alkali metal cluster not in the table, Na 2 , is incorrectly predicted 
to be unbound. 

The three lines in Figure 5 going down from Hartree-Fock give three 
different approaches to improve the theory. The “Cl”, configuration inter- 
action expansion, uses Hartree-Fock as a basis of many-body wave functions. 
This is a very systematic approach, and it gets marvelous results in small 
systems, as you will see in the seminar of Bonacic-Koutecky. But the num- 
ber of terms needed in the Cl expansion to achieve a given accuracy grows 
exponentially with the number of electrons. So one turns to other methods 
to deal with larger systems. 

Many-body perturbation theory was developed in the 1950’s to make 
possible systematic calculation of the energy of quantum systems with an 
infinite number of particles. One avoids the many-body wave function, 
but the price one pays is to deal with Creen’s functions that depend on 
a few more variables. In condensed matter physics, the most refined ap- 
proximation that can be presently computed this way is called the “GW” 
approximation. You can find a review of it by Hedin [43]; I will not discuss 
it here in any detail. 

The last approach to make a better theory is to keep the form of the 
Hartree-Fock equations, but improve the energy function that is put into 
the variational principle. The DFT will thus include effects of correlations 
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by changing the Hamiltonian. In practice, there is another aspect of DFT as 
it is normally applied, the local density approximation (LDA). Evaluation 
of the exchange interaction is computationally burdensome, and approxi- 
mations to make it look like an ordinary potential have been used since the 
time of Slater. In defining the effective exchange potential, one is guided by 
the energies in the uniform electron gas. The exchange energy of an electron 
of momentum k is [44] 



E 



i 






e^k-p 

7T 




k + kp \ 

kp-k J ’ 



(4.3) 



where kp is the Fermi momentum. The total exchange energy (per electron) 
Ex is given by 



^x = -E\*j 



i<j 



r — r 



J* 



= ^e^kp = 

4tt 47t 



where in the last equation we used the relation between electron density 
Tie and Fermi momentum, Ue = kp/Sir'^. Slater proposed making a local 
density approximation by using a two-body contact interaction that would 
have the same total energy. The one-body potential in the Schrodinger 
equation would then be 

Csiater(r) = - (STT^Ue (r)) . 

Z7T 

THIS IS WRONG. Going back to the variational principle, one sees that 
the one-body potential should be defined by the variation of Ex, 

K(r) = — (37T^ne(r))^/^. (4.4) 

On IT 

This is a factor 2/3 different from Slater’s potential. 

Kohn and Sham proposed to include the correlation energy of the elec- 
tron gas in exactly the same way, determining an exchange-correlation po- 
tential V^c from the exchange-correlation energy E^c of the interacting Fermi 
gas. This is usually done through a numerical parameterization, giving us 
the “LDA” of density functional theory. It gives a considerable improve- 
ment over Hartree-Fock, as may be seen by the entries in Table 1. However, 
the accuracy is still not enough for chemical modeling. An obvious problem 
of the LDA is that the single-particle potential does not have the correct 
asymptotic behavior. The electron potential in a neutral cluster should be- 
have as — e^/r for large separation of the electron from the cluster. But in 
the LDA the Goulomb potential is calculated with all the electrons and thus 
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Table 2. Atomic properties of the Ag atom in LDA. 





IP 


First excitation 


Experimental 


7.75 


3.74 


Kohn-Sham: 






eigenvalues 


£s = 4.6 


Ae = 3.9 


total energies 


8.0 


4.1 



vanishes outside the cluster. This makes the LDA unreliable for calculating 
ionization potentials from the Kohn-Sham eigenvalues. 

As an example of the last problem, Table 2 shows the ionization potential 
(IP) of the silver atom and the excitation energy of its first excited state. 
According to Hartree-Fock theory, the Ag atom has a single electron in 
a s orbital, with an unoccupied p orbital just above and a fully occupied 
d orbital just below. Thus the IP should correspond to the energy of the 
s-orbital. For the silver atom, the Kohn-Sham eigenvalue is off by 40% from 
the experimental ionization potential. Nevertheless the LDA can give quite 
accurate ionization potentials if they are calculated a different way, namely 
as a difference of total energies, 

IP = E{Ag) - E{Ag+). (4.5) 

The table shows that the error is only 3% when the IP is calculated this 
way. For the excitation energy, which is an s ^ p transition, the difference 
of Kohn-Sham energies Ae = 6p — €s is fairly good. However, there are 
caveats that we will come back to later. 

Becke [45] proposed a fix to get the — e^/r asymptotic potential by 
adding a term to the energy functional that depends on the gradient of 
the density, Vne(r). His proposed form works amazingly well. This is the 
“generalized gradient approximation”, GGA. From Table 1, we see that 
energies can be calculated to an accuracy of tenths of an eV. Further im- 
provements may be possible. The Kohn-Sham energy functional depends 
on the nonlocal quantity 

i 

in the kinetic energy term. One could think of using other functional de- 
pendencies on t; an example is given in the last line of Table 1 . 

Before going on to electronic excitations, I want to show how well the 
LDA works for describing the infrared active vibrations in Geo- One first 
calculates the LDA potential energy surface as a function of displacement 
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Table 3. Excitation energies of Ceo infrared-active vibrations (Ti„). 



Mode 


1 


2 


3 


4 


Experimental 


0.065 eV 


0.071 


0.147 


0.177 


LDA error 
Ref. [22] 


-2% 


-2% 


+2% 


-bl0% 


Ref. [46] 


-3% 


-7% 


-8% 


-7% 



of the atoms from their ground state positions. Diagonalizing the Hessian 
matrix then gives the normal modes and their frequencies. The results of 
three different LDA calculations are shown in Table 3. The agreement with 
experiment is impressive, with mean absolute relative error on frequency 
only 4%. A more demanding test of the theory is the transition strength 
associated with the vibrations. The accuracy here is perhaps only a factor 
of two [22]. But that is a great improvement over previous theories that 
were completely unreliable. 



4.1 Time-dependent density functional theory 

Schrodinger proposed two equations in his original paper, the eigenvalue 
equation for static properties and the time-dependent equation for the dy- 
namics. But the left-hand side of both equations was the same. The situa- 
tion is the same for dynamic theories based on Hartree-Fock or DFT. The 
theories may be derived from the time-dependent variational principle. 



,5 



dt 





(4.6) 



Taking 41 to be a Slater determinant, and varying with respect to (j>* (r), one 
obtains time-dependent Hartree-Fock equations, first proposed by Dirac in 
1930. The corresponding equations for DFT are the time-dependent Kohn- 
Sham equations. 



+ ^ Vion{r - Ri) j (pi{r, t) = t). (4.7) 

ions / 



The first application of TDDFT was to describe the photoionization of 
atoms [47]. The theory has since been widely applied to clusters, molecules. 
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Fig. 6. Band gap in insulators and semiconductors, theoretical versus experimen- 
tal for the GW approximation and for the DFT. From Hedin [43]. 



and bulk matter. There are several quite different methods to solve the 
equations, which I will not discuss in detail. 

Before describing the applications to clusters, I want to give a cautionary 
remark. It is tempting to apply DFT to excitations by simply replacing the 
ground state single-particle wave functions by excited state orbitals in the 
Kohn-Sham equation. This seems to work well for small systems, as for 
example in the Ag atom discussed earlier, and in small carbon structures 
that will be discussed later. However, the theory can give poor results 
in large systems. This is the “band gap” problem. This may be seen in 
Figure 6 showing the band gaps between occupied and unoccupied bands in 
some insulators and semiconductors. The DFT, shown with black diamonds, 
is consistently low, by as much as 5 eV in the LiF crystal. 

The band-gap problem originates in the LDA treatment of exchange. 
From equation (4.3) one sees that the exchange potential of an electron in a 
Fermi gas has a weak logarithmic singularity at the Fermi surface. Particle- 
hole excitations across the Fermi surface have a higher energy for a given 
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momentum difference than the quadratic kinetic energy functional. Because 
the singularity is weak, it does not show up for small systems. Clearly, any 
local approximation will miss the singularity. An effective, but computa- 
tionally costly method to overcome this is to calculate the electron energies 
from the many-body perturbation theory. In the GW approximation, the 
electron-self energy is calculated including exact exchange. The GW the- 
ory gives an enormous improvement to the band gap, as may be seen from 
Figure 6. But the computational demands of the theory has so far restricted 
its application to relatively simple systems. 

5 Linear response of simple metal clusters 

In this section I want to present some results for metal clusters. The most 
simple are the alkali metals, having only a single valence electron in the 
s-atomic orbitals. The jellium model, introduced in Section 3 and to be dis- 
cussed in detail by Manninen, describes the qualitative properties of sodium 
clusters very well. The metals in the IB column of the periodic table (the 
“coinage” metals Cu, Ag, Au) also have a single valence electron in the 
s-orbital, but in these metals there is a filled d-shell just below the valence 
shell, which affects the properties quite strongly. We shall first discuss the 
alkalis and then see the differences with the IB metals. 

5.1 Alkali metal clusters 

At the crudest level, the optical response of the metals will show a surface 
plasmon at a frequency given by the surface plasmon formula 
equation (3.12). If we take the density rig at the bulk value for sodium, 
the formula gives 



LUs « 3.4 eV. 

In fact, the observed surface plasmon is at a lower frequency (“red-shifted”). 
For example, the optical absorption spectrum of the cluster NaJ,^ in Figure 3 
shows the resonance at a frequency of 2.6 eV. One possibility to explain the 
red shift would be that the density of atoms in the cluster is less than in the 
bulk. This idea is not born out in DFT calculations of the cluster structure, 
so I will not discuss this possibility any further. Another mechanism to ex- 
plain the red shift is the spillout of the electrons, as mentioned in Section 3. 
We can analyze this mechanism quantitatively by calculating the optical 
response of the jellium model. There is a computer code, called JellyRpa, 
that I wrote and distributed [11] to calculate the response of spherical jel- 
lium, making no other approximations on the dynamics. Indeed there is an 
effect of the spillout, as may be seen in Figure 7. The peak is shifted from 
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Fig. 7 . Comparison of jellium model of a large sodium cluster with the surface 
plasmon formula and the TDDFT. The transition from the surface plasmon for- 
mula is shown with a vertical line. 



3.4 to 3.0 eV, which is about half of what is needed to explain the empirical 
position. Another possibility is that the local density approximation might 
not be accurate enough. In particular, we saw in Figure 6 that the local 
approximation to exchange could produce serious errors in infinite systems. 
This question was addressed in finite clusters by Madjet et al. [48], who 
examined and compared the different treatments of exchange. They found 
that the LDA exchange was quite satisfactory for sodium clusters, and could 
not be the reason for the discrepancy. 

Lastly, the jellium approximation might be inadequate; the full TDDFT 
of course includes a realistic treatment of the ionic potentials. Comparing 
the full TDDFT with the jellium, we found mixed results. Taking the ionic 
potential from the naive pseudopotential prescriptions in the literature, we 
found a surface plasmon at the same frequency as in the jellium model. This 
is also shown in the figure. However, there is an different prescription for 
the pseudopotential [49], treating the core electrons somewhat differently. 
This gives an additional red shift, essentially bringing the surface plasmon 
down to the observed position. In the end, it is rather disquieting that a 
seemingly small change in the ionic potential would have a very noticeable 
effect on the absorption spectrum. 





86 



Atomic Clusters and Nanoparticles 




Fig. 8. Surface plasmon in Li clusters with N « 150. The prediction from 
equation (3.12) is shown by the vertical line. 



There is an even larger red shift in lithium clusters: the formula gives 
4.6 eV, while the observed peak in the absorption spectrum is at 3.0 eV [51]. 
Here it is much easier to understand how the shift arises from the ionic 
potential. The ionic potential is different for s and p waves because there 
is a core s orbital that is excluded from the valence wave function but no 
corresponding excluded p state. Thus, the ionic potential is effectively more 
attractive for p orbitals. This makes it easier to excite the electrons from 
the ground state, and lowers the excited state energies. This can be seen 
in Figure 8, comparing the jellium model with the full TDDFT calculation. 
As well as substantially shifting the peak position, the TDDFT disperses 
the strength somewhat on the high-frequency side. Both features are seen 
in the experimental data, which is rather well reproduced by the TDDFT. 

The strength of the surface plasmon in sodium clusters, both theoreti- 
cally and experimentally, is very close collective limit with / equal to the 
number of valence electrons. In lithium clusters, the effect of the ionic poten- 
tial may be interpreted as a giving the electron an effective mass m* « 1.3m, 
which also reduces the sum rule by the same factor {e.g. see [50]). 

5.2 Silver clusters 

The surface plasmon formula is also poor for silver clusters, but for a differ- 
ent reason. Here the formula gives 5.4 eV, but the observed surface plasmon 
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is in the range 3. 5-4.0 eV. In this case, it is not justified to neglect the polar- 
ization of other electrons, namely the filled d-shell [52] . In reference [28] we 
calculated spectra of silver clusters including explicitly the d-electrons in the 
TDDFT. The observed peaks in Agg, for example, are nicely reproduced by 
the theory. In Ag^ the experimental spectrum [53] shows a single peak at 
4 eV, having a width of about 1 eV. The theory gives several transitions in 
the range 3. 6-4. 4 eV that together could give a single broad peak of about 
the needed width. Thus the theory reproduces the energetics of the surface 
plasmon rather well. 

I now want to show the role of the d-polarizability in the properties of the 
plasmon from a more analytic point of view. I start with description of the 
polarizability of the d-electrons in the Ag atom. This somewhat unphysical 
quantity was calculated in [28] by the TDDFT keeping the s-electron frozen 
as the ground state orbital. The polarizability of the core came out to be 
Od « 2 « 14 a.u. at the surface plasmon frequency. In that work we 

derived the effect on the surface plasmon using the linear response technique. 
Here I will show another derivation using the dielectric function. We can 
associate a dielectric function with the d polarizability using the Clausius- 
Mossotti formula, equation (3.7), = (1 + 2ad/rg)/(l — ad/rg). For silver 

i~o = i"s = 3.02, giving « 4. Thus the d-orbital polarizability screens 
external fields by a considerably factor. 

Let us redo the dielectric theory of the response including separate con- 
tributions from the s- and the d-orbitals. To derive the dielectric function 
for mixed sets of charges, we go back to the definition of the dielectric func- 
tion and the relation to surface charges. There will be two surface charges 
screening the field, ad and as- In terms of the internal field inside the 
conductor, the surface charges are given by 

Airas^d — i^s,d l)^int- 

The same equation also holds for the total charge and total dielectric func- 
tion, so one can easily derive for the complete dielectric function 

e = Cd + ~ 1- 

Taking Cd from the Clausius-Mossotti formula and from the free electron 
model equation (3.9) the dielectric function becomes 

^^ l + 2ad/rl _ . (5^^) 

l-ad/r^ Lo{uj + ij 

Next we put this into the Mie formula for the polarizability. With a bit a 
algebra the expression becomes (dropping the zy) 

osl-e o3^?(l - adAo) - 

Oi — it T — it 777Z '/ — o ' 

2-l-e ujl{l-ad/rl)-uj^ 



(5.2) 
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Fig. 9. Integrated strength function in Ags comparing Mie theory (solid line) 
with TDDFT (dashed line). 



From the denominator one sees immediately that the resonance energy is 
shifted to 



w' = Ws(l - 

Taking the previous numerical value for e^, this gives oj = 3.7 eV, which 
is almost exactly what is needed to describe the empirical resonance. In 
effect, the charges of the s-electrons are screened by a factor 

I finally want to discuss the strength of the surface plasmon. Unlike 
the alkali metal clusters, the strength is strongly reduced by the d-electron 
screening, at least in theory. Figure 9 shows the integrated strength func- 
tion, Je = dE'S(E') for Agg, calculated in the TDDFT and in the 
Mie theory using the empirical dielectric constant for silver metal. The 
two curves are surprisingly similar. Thus with respect to the theoretical 
polarizability, a small silver cluster seems to behave as a piece of the bulk 
metal. In the Mie theory, the surface plasmon is seen as the step at 3.6 eV. 
The finite system calculation gives a split peak, with the stronger exci- 
tation slightly blue-shifted. The height of the step gives strength of the 
resonance; we see that it is roughly 2, which is a factor of 4 lower than the 
number of valence s-electrons in the clusters. This another manifestation 
of the screening by the d-electrons. It can also be understood rather simply 
by the above polarizability function. In the neighborhood of the resonance. 
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the above polarizability can be expressed 
/?3 / P 

a ~ ~ “dAo) ( r + ~ ^s) 

l \UJ — OJg 

Putting this in equation (3.4), we find for the associated strength, 

f = N{l-ad/rl)^. 

Since the coupling to the external field goes as e^, each charge here is 
screened by a factor 1 — a^/rg. Numerically, the predicted screening is 
a factor of 4, as found in the microscopic calculation. 

The situation would be completely satisfactory except that it does not 
agree with experiment. The experiment of Tiggesbaumker et al. [53] mea- 
sured not only the position of the resonance in AglJ' but also its strength. 
The integrated strength over the resonance is about 4, more than twice 
the theoretical value. I am sorry that I can not give you a resolution of 
this disagreement. But the measurement of total strength is not easy ex- 
perimentally, and so one should not give up the theory without additional 
confirmation. 

6 Carbon structures 

In this lecture I will discuss carbon structures, going from small clusters and 
molecules to fullerenes and nanotubes. A very simple theory, the Hiickel 
model, turns out to be an excellent guide to the electronic structure ob- 
tained by more elaborate means such as the density functional theory. There 
are two requirements for the Hiickel model to be applicable. The first is that 
each carbon and its neighbors lie in a plane. Then the symmetry with re- 
spect to plane allows the orbitals to separated into ct or tt types having even 
or odd reflection symmetry, respectively. This classification is in fact useful 
even if the planar condition is not strictly met, as in the curved fullerenes 
and nanotubes. The other condition is that the tt orbitals are at the Fermi 
surface. This is the case for all the systems I consider. The tt orbitals on 
different atoms couple rather weakly, and the Hiickel models treats the wave 
function by the amplitude of the tt orbital on each carbon, constructing the 
eigenstates from the simple hopping Hamiltonian 

2 

« ii ,h 

Here the sum ji,j 2 runs over pairs of adjacent carbon atoms, s is a spin 
label, and (3 \s & parameter giving the hopping matrix element between the 
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atoms. The hopping parameter can depend on distance d between carbon 
atoms when that varies. It will turn out that a fair parameterization is 
given by [29] 



P{d) = 2.5 



/1.40 A 



2.7 

eV. 



( 6 . 1 ) 



6.1 Chains 

The first structures I want to consider are linear chains. If the axis of the 
chain is in the z-direction, both the Px and Py atomic orbitals will have tt 
character. To find the spectrum within the Hiickel model, we start with 
an infinite system. Here, by the Bloch theorem, the eigenfunctions have 
amplitudes on the atoms that vary from atom to atom as exponentials: 

~ exp(tjAr) (6.2) 

where AT is a parameter of the eigenstate and j indexes the atom in the 
chain. Then <j)K is the solution of the Hamiltonian equation 

H4>k = sk4>k 



with energy 



e_R- = — 2/3cos AT. (6.3) 

To find the solution for a finite chain, I use a trick. Namely, the wave 
function on a finite chain with N atoms at positions j = 1,2, ...,N behaves 
the same under the hopping Hamiltonian as a wave function on an infinite 
chain, provides the wave function vanishes at sites 0 and N + 1. There is 
no hopping from these sites in the finite system because there are no atoms 
there. The vanishing boundary conditions are satisfied by sine functions 
with parameters K = ■nm/{N +1), where m is an integer. The normalized 
wave functions are then 

, ,, I 2 , 7rmj 

y'myjj Y IV+ 1 N+1 

From equation (6.2) the energies of the orbitals are 
€m = -2/3COS 

Let’s see how well this compares with the LDA energies. In Figure 10 we 
compare the Kohn-Sham energies of the tt orbitals in the cluster C7 with 




G.F. Bertsch: Excitations in Clusters 



91 




Fig. 10. Comparison of tt orbital energies in the C 7 cluster. 



equations (1.2-3). The hopping parameter in the Hiickel model was taken 
from equation (6.1) and energy scale was set by aligning the highest occupied 
orbital, m = 3. The occupied orbitals are indicated by black circles in the 
figure. We see that the Hiickel model is quite accurate in the neighborhood 
of the Fermi energy and below. 

A very important derived property is the gap between occupied and 
unoccupied orbitals. For odd-numbered chains, the highest occupied orbital 
has m = {N — l)/2. Thus the gap energy is 



Ae = 2/3 




{N — 1)7T 
2(fV + l) 



— cos 



{N + 1)7t\ 
2(fV + l ) ) 






Asymptotically, the gap for large chains decreases as 



Ae~ 



1 

N ' 



(6.5) 



( 6 . 6 ) 



The comparison of LDA [25] with the Hiickel model as function of chain 
size is shown in Figure 11. There is one complication that shows up as an 
odd-even staggering in the LDA spectrum. The tt orbitals in a chain have a 
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Fig. 11. Systematics of the energy gap and the collective excitation in carbon 
chains, compared with functional fits of the form (6.6) and (6.10). The filled 
circles show the experimental energies of transitions of clusters in a noble gas 
matrix, from references [54,55]. 



two-fold spatial degeneracy (as well as the usual two- fold spin degeneracy) . 
The highest occupied orbital is only half filled for even N, giving them a 
different gap. 

The excitation of an electron from just below to just above the Fermi 
level will have a large transition strength, close to the number of electrons 
in the tt manifold of states. Under these conditions, the frequency of the 
excitation will be strongly perturbed from the gap energy Ae. When the 
TDDFT is applied, the Coulomb interaction pushes the strength to higher 
energy. Of course the total strength is preserved due to the Thomas-Reiche- 
Kuhn sum rule. 

The TDDFT calculated energies of the strong transitions are show as 
the boxes in Figure 11. The shift from the gap energy is factor of 2-3, 
making the transition very collective. The theory describes the experimental 
data (shown by filled circles) quite well. Note that experimental data is 
only available for the more stable odd- number clusters. A ring structure is 
probably more preferable in the midsize even-number clusters. 

A qualitative description of the transition energies can be obtained using 
the matrix version of the TDDFT theory and considering only the single 
state, the excitation across the gap Ae. The matrix version of TDDFT 
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reduces to a simple formula in that case, giving a transition energy u) 

oj = \/ Ae(Ae + 2v). (6-7) 

This formula is the same as equation (4.12) in reference [18], which contains 
a derivation. To find the interaction v, let us assume that it depends only 
on the distance between carbon atoms and has the following form 



v{ji,j2) 



U jl=j2 



d\ji -h\ 






( 6 . 8 ) 



where d is the distance between carbon atoms. The matrix element for an 
excitation from orbital mi to m 2 is then 



u = ^„,(ji)(() 

m 2 i.h)4> mi (j2)<(' m 2 (j2)u(jl,j2) 

31 ,32 



with the factor 4 coming from the degeneracy of the orbitals. Taking the two 
coefficients in equation (6.8) as parameters, the TDDFT calculations shown 
in the figure are fit with parameter values e^/d = 11.7 eV and U = 11.7 eV. 
The value for e^/d is very close to what one would expect from the distance 
between carbons, d = 1.28 A. Note also that the on-site interaction U is not 
stronger than the interaction between neighboring carbons. The reason is 
that the exchange and correlation energies compensate the stronger onsite 
Coulomb interaction. 

It is interesting to examine the asymptotic behavior of to for large chains. 
We already found that Ae has an 1/fV behavior but v has a different de- 
pendence. With the form equation (6.8) for the interaction, and the sum 
over pairs has an asymptotic logarithmic dependence on N coming from the 
1/1 ji — J 2 I term. The logarithmic dependence was noted in [25] and com- 
pared with an analytic formula analytic formula derived from the plasmon 
dispersion in a long wire. 



2 , 1 

uj = log — 

me qa 



(6.9) 



Here a is the radius of the wire, and q is the wave number. For a finite chain, 
we may take q ~ l/N, and the fV-dependence of the excitation becomes 



i/A + BlnN 

N 



( 6 . 10 ) 



One see in Figure 11 that a straight line (corresponding to 1/N in the 
logarithmic plot) gives a good fit to the A-dependence for Ae but not for oj. 
The wire formula other hand describes the A-dependence of uj quite well. 




94 



Atomic Clusters and Nanoparticles 




ie*'" (j), (j). 



O 



<l>ae 
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Fig. 12. Hiickel wave function construction for polyacetylene. 



6.2 Polyenes 

The next system I will discuss are chain molecules have alternating single 
and double bonds between the carbons, the conjugated carbon molecules. 
These are the polyene molecules, beginning with the two-carbon molecule 
ethylene and going to the infinite chain, polyacetylene. The molecules have 
hydrogen atoms as well, which do not play any role in the tt orbitals. As 
before, it is easier to solve the Hiickel Hamiltonian for the infinite system. 
In the polyenes, the distance between the carbons alternates, depending 
on whether the bond is single or double. Thus there will be two hopping 
matrix elements, f3s and f3d- To solve the problem, we divide the chain into 
unit cells containing two carbon atoms attached by a double bond. The 
system is periodic with this unit cell so we can write the wave function for 
the atoms in the cell and adjacent to the cell as shown in Figure 12. 

Let us apply the Hamiltonian equation H(j> = e(j> at two sites in the unit 
cell. This gives the two equations, 

~/3s(j)be~'‘^ - Mb = e<t>a ( 6 . 11 ) 

~Ma - 

The equations are easily solved (multiplying them together) to get a formula 
for the energy. 



ck = ±.Jl3l+l3l + 2MdCosK. (6.12) 

Interestingly, it has a gap at half filling given by 



Ae = 2|/3d-/3,|. (6.13) 

In this case I don’t know a trick to get the finite system spectrum from 
this result. But the Hiickel Hamiltonian is easy to diagonalize numerically, 
with the dimensionality of the matrix equal to the number of carbon atoms. 
Fitting the LDA energies of the 8-carbon polyene, one finds hopping pa- 
rameters of 2.27 and 2.80 eV. These parameterization fits the LDA energy 
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Fig. 13. Transition energy of the collective tt — tt* transition in polyenes, compar- 
ing theory with experiment. The TDDFT is the solid line; the points are various 
experimental measurements. See reference [29] for details. 



gap of all of the polyenes in the range N = 2—40 very well. However, in 
polyacetylene, the predicted gap from equation (6.13) is 1.08 eV, which is 
rather far off the experimental value of 2.1 eV. This is undoubtedly the same 
gap problem of extended insulators that was mentioned in Section 2. The 
problem is seen in the LDA calculations of very large finite polyenes [56]. 
As in the case of the semiconductors, the GW approximation gives a much 
better gap [57]. 

Let us now examine the excitation energies of the strong transitions. The 
results of the TDDFT are compared with experimental data in Figure 13. 
The agreement is rather satisfactory. It would be interesting to analyze 
these results in the framework of the Hiickel model, but I have not done so. 



6.3 Benzene 

We now turn to the benzene molecule, which for the Hiickel molecule is 
a perfect hexagon of carbons (bond length 1.40 A); the outside hydrogens 
hardly affect the tt electrons. I will analyze the orbitals with a poor man’s 
group theory, utilizing only the 6-fold rotation symmetry about the center of 
the hexagon. We can still make use the wave function from equation (6.2), 
but now the boundary condition is that 4>k{Q) = 4>k{Q), giving K = rmrli 
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Fig. 14. Singlet tt — tt* vertical excitation energies of benzene. 

for integers m. Here m has the same meaning as the azimuthal quantum 
number of the rotation group. The wave functions and orbital energies are 
then given by 

TTITT 

Cm = -2/3 cos • (6.14) 

The lowest state has m = 0, and then there are two degenerate states 
with m = ±1. The 6 tt electrons in benzene fill these three orbitals (with 
each orbital containing two spin states). The lowest unoccupied orbital is 
the two-fold degenerate m = ±2 pair. The sixth and highest orbital has 
m = ±3. The state is the same for either sign of m. The energy gap is 
given simply by C 2 — ei = 2/3. 

Four distinct excitations can be made at the energy gap, due to the 
degeneracies of the orbitals just above and below the gap. The particle-hole 
states can be labeled by the change in m required to produce the state. For 
example, the M = -|-3 state has the electron in orbital m = — 1 promoted 
to the orbital m = +2. Thus it appears that there should be two two-fold 
degenerate excitations, M = ±1 states and M = ±3 states. The former are 
have dipole matrix elements with the ground state and will be prominent in 
optical absorption spectrum. The latter two states mix together, due to the 
indistinctness of the ±3 transition density. This gives an energy splitting 
between the states, leading to a spectrum with three excitation energies. 

Figure 14 shows this energetics, and compares with the full scale time- 
dependent density functional theory. The uppermost state (“^ifi„”) has 
a large optical absorption transition strength, and can be described as a 
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Fig. 15. Comparison of experimental and theoretical absorption spectrum in 
benzene, from reference [29]. 



AM = ±1 transition. The next state down is weaker by a factor of ten, and 
is assigned the linear combination | + 3) + 1 — 3) of the AM = ±3 transitions. 
The other linear combination, | + 3) — | — 3), has vanishing transition density 
on the carbon atoms, and is hardly shifted by the interaction from the 
unperturbed excitation energy Ae. The TDDFT theory gives an excellent 
account of these states with no free parameters. In the Hiickel theory, we 
already get the lowest state because it is at the gap energy. It is interesting 
to see how well a simple Hiickel model does for the energy shifts of the 
other states. Taking the Hiickel model for the wave functions in the 
excitation, the matrix element for the (m = 1) ^ (m = 2) excitation is 

V = i(u(0, 0) + u(0, 1) - u(0, 2) - u(0, 3)). 

Taking u(0, 0) from Section 6.1 and the others as u(0, j) = e^/r^, the matrix 
element is 3.6 eV and equation (6.7) gives an excitation energy of 7.6 eV, 
compared to 7.0 eV experimentally. So in this case the simple formula is 
somewhat inaccurate. 

As one goes higher in energy, there are many more states that can be 
excited, and the TDDFT gives a prediction for the entire spectrum. The 
absorption spectrum is shown in Figure 15, compared with the TDDFT 
theory. As stated above, the sharp state at 7.0 is the collective tt — tt* 
transition. Just above, there is some tiny structure in the experimental 
spectrum that reminds one of tuft of grass. These are the so-called Rydberg 
states, having a very loosely bound electron in the Coulomb field of the 
ion. These states cannot be described in the LDA because of its incorrect 
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Fig. 16 . Comparison of tt orbital energies in the Ceo molecule. 



asymptotic potential field. The broad structure centered around 15 eV is 
due to the more tightly bound a electrons. Its center position and overall 
width is correctly described by the theory. However, one can see that the 
theory has additional structure at high energy while the experimental cross 
section is very smooth. This is probably an artifact of the numerical method 
used to calculate the TDDFT, placing the system in a box. 



6.4 Ceo 

The fullerenes are also carbon structures whose lowest excitations are con- 
tained in the tt— tt* manifold. The nearly spherical shape of Ceo implies that 
angular momentum quantum numbers can be used to some extent to classify 
the orbitals. Thus, the lowest orbital of tt character has an equal amplitude 
on all the carbon atoms, corresponding to L = 0. This classification breaks 
down at high L, where the discreteness of the Hiickel Hamiltonian becomes 
visible in the spectrum. The spectrum of tt orbitals is shown in Figure 16 
with assigned L values. We see that the Fermi level splits the L = 5 or- 
bitals. The arrows in the figure show the allowed tt — tt* transitions. The 
total transition strength occupied and empty tt orbitals is about / = 15, 
which is considerably smaller than the number of tt electrons (40). Experi- 
mentally, and in the TDDFT, one sees a group of transitions at about 7 eV 
excitation with a combined strength of about / = 10. Qualitatively, these 
transitions can be understood with the tight-binding model [12]. 
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Fig. 17. Unit cells in the graphene lattice. 



6.5 Carbon nanotubes 

The last stop on my survey of carbon structures takes us up to extended 
nanostructures, the carbon nanotubes. They have very interesting elec- 
tronic properties. Depending on their structure, they can be conductors or 
semiconductors, and one can even observe the individual conduction chan- 
nels. Their properties may be understood with the Hiickel model, at least 
on a qualitative level. 

As we did before, we start with the theory for an infinite system, in 
this case an infinite graphite sheet. We follow the same route we used in 
constructing the wave functions for polyacetylene. Like polyacetylene, there 
are two atoms in the graphite unit cell (see Fig. 17). We label the two atoms 
“A” and “B” and distinguish the atomic amplitudes accordingly. Let us sup- 
pose we have an eigenstate, decomposed into A-site amplitudes 4>A(jii,n2) 
and 4 >B(ni,n 2 ) where (ni,n 2 ) labels the cell in the two-dimensional lattice. 
Then the Hiickel Hamiltonian only connects A and B amplitudes with each 
other. 




From the structure of these equations it follows immediately that the spec- 
trum of eigenvalues comes in pairs with equal and opposite sign. For each 
solution with energy e we can get another state with energy — e by the 
changing the sign of the 4>b's while keeping <j)A the same. This symmetry 
of the spectrum with respect to the sign of the energy was seen in benzene 
and polyacetylene and will be seen again in Manninen’s lectures, and arises 
whenever the lattice can be divided into two equivalent sublattices with the 
Hamiltonian matrix elements zero within each sublattice. 
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We are particularly interested in the states at the Fermi level, which is 
at zero energy because of the half filling of a symmetric spectrum. So our 
task is now to determine the structure of wave functions of zero energy in 
the graphite lattice. There will be several of them, allowing arbitrary linear 
combinations of (/>a and (j)-Q on some particular unit cell. As an example 
we can assume that 0 b ( 0,0) = 0 and 0 a ( 0,0) = 1. We may also assume 
that the state is a Block wave, so we can characterize it with the vector 
= Ki{x — l/\/iy) + A'|'(2y/-\/3) where the two components K\,K 2 
give the phase change moving a cell to the right and to the upper right, 
respectively. Thus 

0A.B(ni,n2) = e*(^i"^+^="^)0A.B(O,O). 

Then all 0 b amplitudes are zero, and equation (6.15) is trivially satisfied. 
The other condition, i?0A = 0, reads as follows when expanded out for the 
B amplitude on the site (0,0): 

~ 0 (0a(1) ~1) + 0a(O, —1) + 0a(O, 0)) = 

- 0 0a(O, 0) = 0. 

The only way to satisfy this equation is to have the three terms in the middle 
expression be the three cube roots of one; thus 

{Kl,Kl) = ^(2/3, 4/3) or ^(-2/3, -4/3). (6.16) 

Since there is no continuous degree of freedom left in the choice of Kf and 
K 2 , the Fermi surface is a set of isolated points. This is the well-known 
structure of the graphite Fermi surface: there is no energy gap between 
occupied and empty orbitals, but unlike a metal the Fermi surface has zero 
area. For other values of {Ki,K 2 ), the energy can be found the same way 
we used for polyacetylene. The result is 

e{K) = ±0a/ ( 1 -I- cos(ATi - K 2 ) + cos K 2 Y + (sin(ATi - K 2 ) - sin K 2 )‘^. 



When K = Kix + K2{xj2 + -\/3y/2)is near a Ky, the result can be approx- 
imated as [59] 



e{K) 



V3 



m-KF\- 



(6.17) 



We are now ready to look at nanotubes. We can construct a nanotube 
by rolling up a sheet of graphite, joining the sides so the hexagons align. 
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Fig. 18. Fermi surface on a carbon nanotube. 



as shown in Figure 18. The various ways of aligning the hexagons can be 
distinguished by counting along the hexagons how far it is around the tube 
to come back to the starting hexagon. The counting is done along adjacent 
hexagons along the two directions corresponding to K\, K 2 above. 

Our e = 0 wave function will wrap around onto itself if the phases match 
on the superposed hexagon. This is the case for the tube in Figure 18, but 
will not be so if the hexagons are displaced one or two in any direction. The 
precise condition is 

KiNi + K 2 N 2 = 2mr 

which from equation (6.16) is equivalent to 

mod(fVi — A^ 2 , 3) = 0. 

If it is satisfied, there is a state at the Fermi energy and thus there is a 
conduction channel. If the condition is not satisfied, there is a gap at the 
Fermi surface and the conduction channels only open when the chemical 
potential is changed to put the Fermi level with the allowed states of the 
channel. 

The density of states can be measured by scanning tunneling microscopy 
and one can see the higher states from the peaks at the thresholds (called 
Van Hove singularies) [60]. In the cited reference, the authors observed a 
number of peaks in a nanotube that they identified as a (13,7) structure, 
nearly symmetrically distributed about e = 0. The ones closest to the 
middle were at e « ±0.9 eV. To see whether this makes sense, let’s put the 
boundary condition for a simpler tube, the (10,10), into equation (6.17). 
The extra state then appears when lOAiFi ± 10A7F2 = 2tt. Taking AKi = 
AK 2 , the condition becomes AKi = tt/10; and the formula gives (3Ki « 
(2.5)(7t/10) « 0.78 eV, which is rather close to the measured value. 
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6.6 Quantized conductance 



The conductance of the individual channels can be derived from the statis- 
tical reaction theory with two assumptions. Let us attach the two ends of 
the nanotube to (electron) reservoirs and measure the current as a function 
of the Fermi levels of the two reservoirs. The first assumption is that the 
electrons travel through the tube independently. Then we can apply the 
statistical theory to the single-electron level densities. The next assump- 
tion is that the electrons go through without reflection and disappear into 
the other reservoir, i.e. the transmission coefficient of the channel is unity. 
It is then very easy to derive the formula for the conductance considered 
as a statistical emission of electron from each reservoir into the other. The 
formula is 



where V is the potential difference of the two reservoirs in volts and Uc is the 
number of open channels, including the two spin states of the electron as two 
separate channels. I will not go through the details in these notes because 
the algebra is just the same as in my two-page published derivation in [12]. 
The staircase behavior of the above expression as the channels open up is 
seen in many conductance measures on nanoscale structures. An example 
of this phenomenon seen in the carbon nanotubes is [61]. This particular 
experiment seems to show a case where rzc = 1, which can only be if one spin 
state of the electron is blocked. To my knowledge, there is no explanation 
for a strong spin dependent transmission. 
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DENSITY FUNCTIONAL THEORY, METHODS, 
TECHNIQUES, AND APPLICATIONS 



S. Chretien^ and D.R. Salahub^’^’^ 



Abstract 

Important concepts of Density Functional Theory (DFT) and the 
different types of approximation for the exchange-correlation energy 
functional are presented. Applications illustrate the advantages of 
using DFT as a computational tool and point out some limitations 
as well. 



1 Introduction 

Computer aided design of new materials and inhibitors with desired proper- 
ties requires the description of chemical reactions in complex environments 
with a high degree of accuracy, especially if one needs to know the effect of 
temperature, pressure, solvation and/or pH on a reaction. At the present 
time, a full quantum mechanical treatment is out of the question, even 
with linear scaling methods [1]. To make this kind of calculation tractable 
on today’s computers, without loss of too much accuracy, the system has 
to be broken into pieces and different computational techniques used for 
the different parts of the model. The combination of quantum mechanics 
and molecular mechanics (QM/MM) using classical force fields is usefull in 
catalysis [2] and in biological systems [3]. For example, the active site of 
a protein, including the inhibitor, can be computed quantum mechanically 
and the rest of the protein by molecular mechanics. In this model, the pres- 
ence of the solvent molecules, could be added explicitly or by a continuum 
model. The challenge with this kind of approach is to make the techniques 
work together. 
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Density Functional Theory (DFT) is often the quantum mechanical 
method of choice because the same accuracy as high-level traditional 
ah initio techniques may be attained but with a fraction of the compu- 
tational time. Larger molecules can be studied with DFT. 

In this lecture we present DFT from a chemical point of view. Chemical 
applications are chosen to illustrate both the successes and the limitations 
of DFT. For more details about electronic structure calculations and DFT 
the following books are recommended [4-7]. 

The first application we have choosen is an ah initio molecular dynamics 
(AIMD) simulation of an alanine dipeptide analogue. This calculation in- 
troduces temperature effects through molecular dynamics in which the clas- 
sical force field is replaced by a quantum mechanical calculation. The study, 
along with others, points to the need to include polarization effects in fu- 
ture molecular mechanics applications. In the second example, we compare 
the experimental pulsed-field-ionization-zero-electron-kinetic-energy 
(PFI-ZEKE) spectrum of V3 with the simulated one. This demonstrates 
the power of DFT to predict equilibrium structures for demanding transi- 
tion metal systems. The third example is about small nickel clusters for 
which one sees the need for attention to technical details for these difficult 
systems that involve close lying states with different magnetic structure. 
The last example concerns preliminary results from a DFT study of the for- 
mation of benzene from acetylene catalysed by iron clusters. Although the 
theory, methods and applications presented in this chapter are, inevitably, 
far from exhaustive we think they will provide a good entry to the literature 
and provide an inkling of what can, and what cannot, be done with DFT 
as it stands today. 

2 Density functional theory 

For a time-independent system, the total electronic energy is obtained by 
the solution of the time-independent Schrodinger equation [8], 

H'^{xi,X2, ■ ■ .,xn) = E^{xi,X2, ■ . -,xn)- ( 2 . 1 ) 

^{xi,X2, ■ ■ ■ ,xn) is the ground-state many-electron wave function and Xi 
contains both the spatial, r), and the spin, ai, coordinates of electron i. 
For a system composed of N electrons and M nuclei the non-relativistic 
Hamiltonian, in the Born-Oppenheimer approximation [9], is given by (in 
atomic units) 



= f + Ke + t>ee. 



( 2 . 2 ) 



This operator is composed of the kinetic energy operator. 
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N 



^ = E 

2=1 




the nucleus-electron attraction operator, 



(2.3) 



K, 



N N M 







(2.4) 



(where Za is the atomic number of atom a) and the electron-electron re- 
pulsion operator, 






N N 

= EE' 

2=1 j>i 



(2.5) 



The many-electron wave function is not an observable and is a complicated 
function that depends on 4N coordinates (xi, yi, Zi and ai for each electron) 
which prevents its visualization in real three-dimensional space. Exact so- 
lution of the Schrodinger equation is limited to a small number of electronic 
systems. On the other hand, the electronic density can be measured experi- 
mentally by X-ray diffraction and depends only on three spatial coordinates 
{x, y and z). Moreover, p(f^ can be obtained from the integration of the 
total many-electron wave function over all coordinates, except the spatial 
coordinate of one electron (the one we are looking at). 



p(fi) = N 



d(Jidx2dxz ■ ■ ■ daiAr|4'(a?i, a?2, ■ • ■ , xn)\^- (2.6) 



This function is a probability distribution that corresponds to the proba- 
bility of finding simultaneously electron 1 with spin cti at position rq and 
electrons 2, 3, . . ., fV in the volume element da? 2 ,da? 3 , . . . ,dxN respectively. 
p{f) is a positive function that goes to zero at infinity (r ^ oo) and gives 
the total number of electrons when integrated over all of space. 



J p{fi)dri = N. (2.7) 

For all the properties mentioned above, p(r) seems to be a good alternative 
to the complicated total many-electron wave function. But, is it theoreti- 
cally justified to use the electronic density as the cornerstone of electronic 
structure calculations? This question was answered by the Hohenberg and 
Kohn (HK) theorems [10] published in 1964. 
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2.1 Hohenberg and Kohn theorems 

The only system-dependent part in the Hamiltonian operator (2.2) is the 
external potential, v{f), because the kinetic (2.3) and electron-electron re- 
pulsion (2.5) operators are fixed once the number of electrons is known. In 
traditional wave function theory, if we know v(f), which fixes H and dr, 
then p{r) is obtained from (2.6). In other words, p{r) is a functional^ of 
v(r^ and dr. Hohenberg and Kohn [10] proved the converse, v(f) and dr are 
unique functionals of p{r). The proof is very simple and Hohenberg and 
Kohn proceed by reductio ad absurdum. Suppose we have two Hamiltonian 
operators, H = T + Vee + f v(r)p(r)dr and ff' = T + 14e + / u'(f)p(r^dr, 
which differ only by the external potential. We also suppose that both ex- 
ternal potentials differ by more than an additive constant but give the same 
ground state density, po{r). Clearly, a different ground state energy, Eq and 
Eq, and ground state wave functions, drp and dr^, is associated with each 
Hamiltonian operator. Using dTg as a trial function for El we can write 



Eo<{%\H\%) = (dr'|H'|dr') + (dr'g|i7-H'|dr') 

= Ag -I- y p{7^[v{r) - v' {r)]dr. (2.8) 

Doing the same operation for E'^ we find 

A' < (d'olff'jd'o) = (d'o|i?|d'o) + (d'o|i?'-H|d'o) 

= Eq — J p{l^[v{r) — v' {r)]dr. (2.9) 

Adding (2.8) and (2.9) we obtain the following contradiction 

E^ + E'^<Eo + E'^. ( 2 . 10 ) 

This means that it is impossible that two external potentials that differ 
by more than an additive constant give the same ground state density. 
Consequently, the external potential is a unique functional of p{r^. Hence, 
the ground state density, po{r^, determines N, v{f), d^g, Eq and all the 
ground state properties. For a given external potential, v{r), the total 
energy expression is a functional of p(f), 

Ev[p{r^] = J v{-r)p{-r)dr + Euk[p{t^] ( 2 . 11 ) 

function requires a number as input and delivers a number as output. A functional 
requires a function as input and, as a function does, delivers a number as output. 
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with 



FHK[p(r)] = (d/|f + t>ee|«') = T[p(r)] + Cee[p(rO]. (2.12) 

Fm^[p(f)\ is a universal functional of p(r) because it is fixed when the num- 
ber of electrons is known. In theory, if we were able to write down the exact 
mathematical form of Fhk[p(?^]) then the ground-state energy and density 
would be known exactly. In practice, we do not know it, but we do know 
that it exists and we know some properties that it should possess. 

In their 1964 paper [10], Hohenberg and Kohn presented a second theo- 
rem, a variational principle with p(f) as the basic variable. The total energy 
functional (2.11) will be minimized if the trial density, p(f), is the ground 
state density 



E^[po{r)] < E^[p{r)] = j v{r)p{r)dr + F[p{Ei\. (2.13) 

The trial density is restricted to be a positive well behaved function re- 
stricted to integrate to the total number of electrons (fV[/5(r)] = f p(r)dr = 
N) and must vanish at infinity. 

These two very simple theorems were the first theoretical justification 
of using p(E) instead of T to determine all the ground state properties. 
But, the first theorem is limited to non-degenerate ground states and it 
is not guaranteed that for all “normalisable” p{E), a physically realisable 
potential exists. This last uncertainty is the u-representability problem and 
is important because we are dealing with real systems in chemistry and this 
external potential determines the nuclear position according to (2.4). In 
the next section, we will see that Levy’s constrained search [11,12] removes 
some of the restrictions met in the Hohenberg and Kohn theorems. 

2.2 Levy's constrained search 

We need a way to select, from all the acceptable trial densities only those 
that correspond to a physical external potential. 

The variational principle (2.13) can be equivalently written as: 

Eq < min < min 

p(r) 

This equation represent a two-step optimization procedure. In the first 
step, the inner minimization, we search all antisymmetric wave functions 
that yield a given trial density. In the second step we search, among all 
acceptable densities, the one that minimizes the total energy. We have an 
equality in (2.14) if the trial density is po{r). Since the external potential 



(^|T+t>e. 



N 

E 

2=1 



u(fi)|T) 



(2.14) 
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is fixed by the density only, and will be constant for all dr this yields the 
same p(r), we can define the universal functional 

Q[p]= min (^If + Ceel^)- (2.15) 

This universal functional corresponds to Thk if the trial density corre- 
spond to po{f). This way of demonstrating the first Hohenberg-Kohn 
theorem removes the limitation to nondegenerate ground states. Moreover, 
using (2.15) replaces the stringent u-representability condition on p(r) by 
fV-representability, which is easier to satisfy. A density is A^-representable if 
it is normalisable (2.7), acceptable and comes from an antisymmetric wave 
function. Using the definition for the universal functional, the variational 
principle (2.14) becomes: 



Eq < min 

p(r) 




v{r)p{f)dr 



(2.16) 



2.3 Kohn-Sham method 

The Thomas and Fermi method [13, 14] was the first density functional 
model. The approximations used were too crude. The simple form for 
the kinetic energy functional, Ttf[p] = Cp f /o(r^®/^dr, derived from the 
homogeneous electron gas, is the source of part of the problem. Kohn and 
Sham [15] decided to compute the biggest contribution to the universal func- 
tional, F[p] as accurately as possible. In this method, the classical coulomb 
repulsion energy, J[p], and the exact kinetic energy for a system of non- 
interacting particles having the same density as the interacting one, Tg[p], 
are calculated explicitly. The remaining part of F[p\ is the exchange and 
correlation functional, E^c\p] (2.18). This functional contains the residual 
kinetic energy (T[p\ — Tg[p]) and non classical electron-electron repulsion 
energy (U,e[p] — J[p\), be. exchange, self-interaction and correlation. 

F[p] = J[p]+Up]+E^,[p] (2.17) 

A,e[p] = T[p] - T,[p] + Uee[p] - J[p\. (2.18) 

To calculate the exact noninteracting kinetic energy, Kohn and Sham [15] 
introduced a set of one-electron orbitals, {pi}, that are eigenfunctions of 
the effective one-electron Hamiltonian 

+ Veff(r) 






(2.19) 
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In terms of these one-electron orbitals, the electronic density is: 

N 

( 2 - 20 ) 

i=l <r=T,i 

Varying E[p] subject to the normalisation constraint (2.7), we find the ef- 
fective potential. 



^eff(rO = ^ f dr' + v{f) + v^c{r^, (2.21) 

2J |r-r'| 

that minimizes the energy expression. 



E[p] = T^[p] + i 



p{f)p{r') 
|r — r 'I 



drdr ^ -|- 



v{r)p{r)dr + £^xc[p(?^]- (2.22) 



Where the exchange-correlation potential, fxc, is given by 

SE^c[p] 

^xc — c 

op 



(2.23) 



Equations (2.19, 2.20) and (2.21) are the Kohn-Sham (KS) equations. They 
have to be solved self-consistently like Hartree-Fock (HF) equations. 
Equation (2.19) can be written in a form similar to the Hartree-Fock equa- 
tion, 



JKS^KS^^KS^KS_ (2.24) 

Even though there is a similarity between KS and HF equations, the eigen- 
functions are different. We will come back to this point in Section 5. 

The Kohn-Sham equations are exact. In principle, we can get the exact 
ground state density. The approximations enter Exc[p] and Uxc for which 
no exact form has been found until now (we will see in Sect. 7 that they 
are expressed by complicated mathematical functions). However, better 
and better approximations exist and this explains the popularity of DFT in 
chemistry. 

In the next five sections, we introduce concepts that should help to 
understand the different approaches and approximations present in different 
exchange-correlation functionals . 

3 Density matrices and pair correlation functions 

The Hamiltonian operator (2.2) contains only one- and two-electron oper- 
ators which means that we can express the total energy in terms of the 
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probability of finding one electron and pairs of electrons in space. In this 
section, we introduce the concept of first- and second-order spinless density 
matrices. For more details on density matrices, we recommend the book of 
Davidson [16]. 

Focusing on the energy, one can greatly reduce the number of variables 
in the Schrodinger equation by integrating the product of the many-electron 
wave function with its complex conjugate form over all variables, except the 
spatial coordinates of two electrons, and summing over spin. By doing this, 
we obtain the second order spinless density matrix. 



p(fi,f2,r'i,r2) 



N{N- 1) 
2 



'i>{ri,ai,f2,(J2,X3, . . . ,xn) 



4'*(r ^,(Ti,r 2,(72, X3, • ■ • ,XAr)d(Tid(J2dx3 . . .dfAr.(3.1) 



The pair density matrix. 



P2{ri,r2) = P2{ri,r2,ri,r2) (3.2) 

|4'(Fi,f2, • ■ • ,XAr)pd(Tid(T2df3 . . .dxN, 

is the diagonal element of the second order density matrix, ^ 2 ( 74 , 74, 7 ^ 2 ), 

and corresponds to the probability of finding a pair of electrons, one at fq 
and another at r 2 , when the system is in the state represented by 4'. We 
introduce a short-hand notation. The subscript of p refers to the order of 
the matrix and we write P 2 (74,74) instead of P 2 (74, 74, 74, 74) while keeping 
in mind that r j = f) and r 2 = ? 4 . This probability distribution contains 
all the information about electron correlation and gives the total number of 
distinct pairs of electrons when integrated over the whole space with respect 
to ri and T 2 - Moreover, one can show that the probability of finding two 
electrons with the same spin at the same place in space, ^ 2 ( 74 , 74 ), is zero, 
which satisfies the Pauli principle. 

The pair density matrix can be equivalently written as: 

P2{ri,r2) = p{ri)p{r2)[l + h{fi,f2)]. (3.3) 

^( 74 , 74 ) is a hole function. We will discuss its physical meaning in the next 
section. From (3.3), if h{ri,r 2 ) = 0, the pair density matrix corresponds 
to the limit of two uncorrelated charge distributions, p{ri) and p{r 2 ), so 
^(74,74) represents the departure from this uncorrelated limit. The hole 
function contains all the information about nonclassical effects including 
electron correlation (14e[p] — J[p]) and the residual kinetic energy (T[p\ — 

Ts[p]). 



N{N -1) 
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The first-order spinless density matrix is obtained by integrating the 
product d/v]/* over all variables, except the spatial coordinates of one elec- 
tron, and summing over spin. 






N 



'i>{fi,ai,X2, ■ ■ -,xn) 



'i'*{r[,ai,X2, • • • , XN)daidx2 ■ ■ ■ dxN- 



(3.4) 



The diagonal element of this matrix, pi(ri, ri), is the electron density (2.6), 



piin) = Pi{fi,fi) = N 



\d>{ri,X 2 , . . • , XN)\^da\dx 2 dxz ■ ■ ■ da; at. 



(3.5) 

In terms of first- and second-order spinless density matrices, the total energy 
is given by: 



E 



(?4 , ) , P 2 (ri , ^ 2 , r i , r 2 )] 




v{r)pi{r)dr 



(3.6) 



4 Adiabatic connection or coupling strength integration 

The adiabatic connection method [17-20] represents one of the most impor- 
tant concepts in DFT. We start by introducing a coupling strength param- 
eter, A, that varies between 0 and 1 into the universal functional defined by 
the constrained search (2.15), 

Fx[p]= min (^|f + Al>ee|^) = (^^iT+AKel^^)- (4.1) 

This parameter switches OFF (A = 0) or ON (A = 1) the electron-electron 
interaction. The equation above implies that for each value of A, the many- 
electron wave function must integrate to the exact density. In other words, 
p{f) is independant of A. This is achieved by adjusting the external potential 
for each value of A. In the two limiting cases, the universal functional is. 



Fo[p] = n[p] 

Flip] = T[p] + V,,[p\. 



(4.2) 

(4.3) 
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With these definitions, the exchange-correlation functional (2.18) can be 
written as: 

[p] = F, [p] - Fo [p] - J[p\ = - J[p] . (4.4) 

This formula connects the Kohn-Sham non-interacting reference system 
(A = 0) with the real fully interacting system (A = 1) by an integration over 
A. The integrand for different values of A is evaluatated at the real (fully 
interacting) density. This process is called the adiabatic connection because 
of the analogy to an adiabatic process in thermodynamics. Here, one goes 
from A = 0 to A = 1 by infinitely small variations in A at constant density 
instead of constant energy. 

Moreover, using the Hellmann-Feynmann theorem, one can show, 

= (^^IKel^^)- (4.5) 

Summing over spin and integrating this definition over all spatial coordi- 
nates except two, ri and fg, we introduce the A-dependent second-order 
spinless density matrix into the definition of Fxc, 

E^c[p\ = j J dAp2(ri,r2) - J[p]. (4.6) 

Using the pair density matrix definition (3.3) and performing the integration 
over A brings in the A-averaged hole function, h{ri,f 2 ), 

dAp^(ri,r2) = ip(ri)p(r*2)[l -f Ji{ri,r2)]- (4.7) 

Finally, introducing the definition of the exchange-correlation hole function, 

Pxc{ri,r2) = p{r2)Hri,r2), (4.8) 

we obtain the following definition of the exchange-correlation energy func- 
tional. 



Exc[p] = ^ J J |^^^^^P(F)Pxc(ri,r2). (4.9) 

The A-averaged exchange-correlation hole, Pxc(7iU2), is very important be- 
cause it fixes the exchange-correlation energy, it contains all the information 
about the non-classical electron-electron interaction as well as the residual 
kinetic energy, and corresponds to the probability of finding an electron at 
7*2 when the reference electron is located at r). The hole function moves 
with the reference electron and we talk about a hole because it reduces 
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the probability of a second electron coming too close to the reference one. 
The hole function digs a hole in the probability distribution of the second 
electron near the location of the reference electron. 

The exact form of the exchange-correlation hole function is unknown 
but we do know some properties that it should posseses. For example, the 
hole contains exactly one electron, 



Pxc (ri,r2)dr2 



p(r2)/i(ri,r2)dr2 = -1. 



(4.10) 



This property is known as the sum rule. 

As we will see in Section 7, it is convenient to separate the exchange- 
correlation energy in terms of separate exchange and correlation functionals. 
This yields an exchange hole function. 



Px(ri,r2) 



1 \pi{ri,r2)? 

2 p{ri) 



also called the Fermi hole and a correlation hole. 



(4.11) 



Pc{ri,r2) = Pxc(ri,r2) - Px(^i,r2), (4-12) 

also called the Coulomb hole. From these definitions, the exchange and 
correlation holes have the following properties: 



Px(ri,ri) = -p(ri) (4.13) 

Px(ri,r2)dr2 = -1 (4.14) 

Px(ri,r2) < 0 (4.15) 

Pc(ri,r2)dr2 = 0. (4.16) 



Equations (4.13) and (4.14) have an important physical meaning; 
equation (4.13) corresponds to the Pauli exclusion principle and (4.14) 
is the self-interaction repulsion correction. For a graphical representation 
and the spin-dependent form of the exchange and correlation hole functions, 
see the review article of Ziegler [21]. 

Up to this point, we still have an exact theory. Approximations are 
introduced when one introduces models for the exchange-correlation hole. 
For the construction of a functional, we need to know the A-average of this 
hole. 

The adiabatic connection formula provides a means to incorporate more 
and more properties of the exact Kohn-Sham exchange-correlation func- 
tional into a new approximate functional. In a sense it plays an equivalent 
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Table 1. Comparison between KS-DFT and HF-Cl. 



HF-CI 


KS-DFT 


Reference 




HF, p^^{r) 


KS, p(rO 


Exchange 






($KS|-14^|$KS^_ 


Correlation 




E®xact[p] _ £:HF[^HF] 


Ey,c[p\ - Ey\p] 


Correlation components 


Tcorr = _ ^HF^^HF] 


TM = - T,[p] 


wNE _ r (p(r)-p^’"(r))Zc 

•''^corr I-Rq— ri| 

wcoul _ f f Ip(ri)p(f 2 )-p^^(fi)p^^(f 2 )] 

*^corr J J |ri— r 2 | ci/ici/2 




Everything else 


Ec[p{r)] 



role as adding more and more determinants in ab initio methods, to include 
more correlation. 



5 Comparing and constrasting KS-DFT and HF-Cl 

In this section we present the formal differences between the definition of the 
exchange and correlation energies used in DFT and in traditional quantum 
chemistry or ab initio methods. The characteristics of both methodologies 
are summarized in Table 1. 

The first major difference, is the reference system. In the ab initio world, 
the HF equations, and the accompanying HF density, are taken as 

the reference. In KS-DFT, we use a fictitious non-interacting reference 
system but the reference density is the exact one. 
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In ab initio methods the exchange energy is computed exactly using the 
HF orbitals, 







^ja in ) (^2 ) (Pka (r 1 )(pja{r2) 

\ri - r2\ 



dridr2. 



(5.1) 



The same definition can be applied to KS-DFT using the KS orbitals, 
This is how one gets a part of the exchange energy in hybrid 
methods (see Sect. 7.5). This way of calculating the exchange energy gives 
rise to a nonlocal® potential which is in opposition to pure DFT methods 
where local potentials are used in the definition of Ueff(r) (2.21). However, it 
is possible to calculate the exact KS-DFT exchange energy through a local 
exchange potential using the optimized effective potential (OEP) method 
(see Sect. 7.6). In Table 2 we compare the exact exchange energy computed 
with this approach and the exact HF energy. The difference between these 
two quantities increases with the atomic number. 

Traditionally, the correlation energy is defined as the difference between 
the exact many-electron wave function energy (without relativity) and the 
HF energy. Importantly, two densities are involved, the Hartree-Fock den- 
sity and the exact density. With this definition, there are three major 
contributions to the correlation energy in ab initio methodology. The first 
contribution is a kinetic energy component. The exact kinetic energy in- 
volves the exact many-electron wave function, corresponding to the exact 
density. To calculate Tcorr, one subtracts the Hartree-Fock kinetic energy. 
The last two contributions arise from the fact that we use p^^(r^ to com- 
pute the nuclear-electron and the electron-electron coulomb energy. These 
two contributions are absent in KS-DFT because we use the exact den- 
sity throughout. Moreover, the kinetic energy component in the KS-DFT 
correlation energy does not correspond to the one previously defined. It 
corresponds to the difference between the exact kinetic energy of the real 
system and the exact kinetic energy of a fictitious non-interacting reference, 
both evaluated at the exact ground state density. The KS-DFT correlation 
energy contains also a contribution from the electron-electron interaction, 
Tfxc, beyond the classical coulomb interaction and the exact KS-DFT ex- 
change definition. 

In Table 3, we compare the exact correlation energy from KS-DFT and 
traditional quantum chemistry for atoms. One clearly sees that the following 
relation [26] is respected 



I^QC(exact) | 



<\E, 



'DFT(exact) i 



(5.2) 



definition of local and nonlocal terms is given in Section 7.1. 
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Table 2. Comparison of exact atomic HF and KS-DFT exchange energies (from 
Ref. [22]) with exchange energies obtained from approximate exchange fnnctionals 
(from Ref. [23]) (in Hartrees). The corresponding error relative to is given 
in parentheses (in %). 







^KS 


^LDA 


^PW91 


He 


-1.026 


-1.026 


-0.884(-13.82) 


-1.017(-0.88) 


Be 


-2.667 


-2.666 


-2.312(-13.26) 


-2.645(-0.77) 


Ne 


-12.108 


-12.105 


-11.033(-8.85) 


-12.115(0.08) 


Mg 


-15.994 


-15.988 


-14.612(-8.61) 


-15.980(-0.06) 


Ar 


-30.185 


-30.175 


-27.863(-7.66) 


-30.123(-0.17) 


Ca 


-35.211 


-35.199 


-32.591(-7.41) 


-35.165(-0.10) 


Zn 


-69.641 


-69.619 


-65.645(-5.71) 


-69.834(0.31) 


Kr 


-93.856 


-93.833 


-88.624(-5.55) 


-93.831(0.00) 


Sr 


-101.955 


-101.926 


-96.362(-5.46) 


-101.918(-0.01) 


Cd 


-148.914 


-148.879 


-141.543(-4.93) 


-148.885(0.00) 


Xe 


-179.097 


-179.063 


-170.566(-4.75) 


-178.991(-0.04) 


Ba 


-189.100 


-189.065 


-180.241(-4.67) 


-189.034(-0.02) 


Yb 


-276.214 


-276.145 


-265.563(-3.83) 


-276.939(0.29) 


Hg 


-345.304 


-345.244 


-332.143(-3.80) 


-345.427(0.05) 


Rn 


-387.504 


-387.450 


-372.981(-3.74) 


-387.417(-0.01) 



Table 3. Comparison of total correlation energy (in Hartree) (from Ref. [24]) and 
the kinetic correlation energy (from Ref. [25]) obtained from Density Fnnctional 
and traditional Quantum Chemistry. 





^DFT 


EQC 


Tc[p] 


H" 


-0.041995 


-0.039821 


0.0280 


He 


-0.042107 


-0.042044 


0.0366 


Be2+ 


-0.044274 


-0.044267 


0.0415 


Ne®+ 


-0.045694 


-0.045693 




Be 


-0.0962 


-0.0943 


0.0737 


Ne 


-0.394 


-0.390 





We compare also the kinetic energy contribution to the correlation energy. 
In ab initio methods, the virial theorem, 



^coor — A-coor , 



(5.3) 
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is satisfied. Then, the absolute values of the second column give Tcoir- Once 
again, we see that the corresponding values in KS-DFT are different. 

Through Koopmans’ theorem, the eigenvalues of the HF orbitals have 
a physical meaning for closed-shell molecule. They correspond to minus 
the energy required to remove an electron from that orbital. Values ob- 
tained from Koopmans’ theorem are always bigger than the experimental 
values obtained from photoelectron spectroscopy. This is so because, in this 
approximation, we use the same orbitals to describe the N and the N — 1 
electron systems. Clearly, the V — 1 electron system must be relaxed to take 
into account the reduction of the sceening effect created by the removal of 
one electron. Further discrepancy is due to the lack of correlation in the 
HF method. The Koopmans’ theorem does not apply to DFT. However, 
it has been proved [27, 28] that, using the exact exchange-correlation po- 
tential, the eigenvalue of the highest occupied molecular orbital (HOMO) 
of the neutral molecule corresponds to minus the ionization potential (IP) 
and the corresponding eigenvalue in the negatively charged species gives 
minus the electronic affinity (EA), 



£homo(.^) = —IP (5.4) 

£homo(A^+1) = -EA. (5.5) 

In Tables 4 and 5 of Section 7.3, atomic eigenvalues of the HOMO obtained 
from various approximations are compared with experiment. 

The last difference that we want to bring to the attention of the reader 
is the presence of the self interaction energy in approximate exchange- 
correlation functionals. In the HF method the electron-electron repulsion 
energy is, 

N N 

= (5-6) 

i=l j=l 

where 

Jij = J J ^*(D)^j(^2) ]_ ^i(ri)(()j(r2)dridr2 (5.7) 

Kij = J J (t>iin)(t>j{r2) ]_ ^j(Ei)^i(r2)dridr2. (5.8) 

The self coulomb energy, Ju, and the self exchange energy, Ku, arise because 
i could equal j in (5.6). These two terms cancel each other in HF theory as 
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they should. On the other hand, the coulomb energy in DFT is computed 
using the density, 



and contains self interaction contributions. If one uses the exact exchange 
correlation energy functional, the self coulomb energy will be exactly can- 
celed by the self exchange energy as in HF. However, because we necessarely 
use approximate functionals there is a contribution from the self-interaction. 
This is one reason that the total energy in DFT is higher than in HF. 

6 Preparing new functionals 

Considering the adiabatic connection procedure one can formulate a recipe 
to prepare new exchange-correlation functionals that can be summarized as 
follows: 



• propose a model for ft,(ri,r2); 

• choose properties that h{r\, r^) should fulfill. For example, normalisa- 
tion, perfectly self-interaction free, contains opposite and parallel spin 
correlation, various scaling relationships. In principle, these proper- 
ties can be derived from the exact wavefunction; 

• determine its A dependence and integrate over A; 

• test it on a set of reference data. 



In principle, this recipe is straightforward but, in practice, the various steps 
present substantial difficulties. 

The choice of properties for h{r\,r2) may be influenced by the nature of 
the applications on which we want to apply the functional. For example, the 
asymptotic condition is clearly important if one is interested in properties 
that depend on the long range behavior of Uxc such as the calculation of 
excitation energies. This property is not so crucial if one is interested in the 
calculation of electronic properties that do not depend on the unoccupied 
orbitals. 

In practice, it might seem very hard to determine the A dependence of 
ft,(ri,r2). The adiabatic connection formula (4.9) per se does not explain 
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how to do this. However, Levy [29] showed that by homogeneous scaling of 
the density using a coordinate factor 7 (where 7 is 1/A), 

jy, jz), (6.1) 

we can express the adiabatic connection method as: 

if.e[p] = ifx\[p]dA = Aif4=l [pi/;,]dA. (6.2) 

Jo Jo 

This formula shows that we need only know to get the value for 

all values of A’s. This equation implies also that knowing the value of [p] 
for some small value of A suffices to generate all the values of [p\ ■ In his 
contribution. Levy showed that the exact exchange-correlation hole satisfies 
the same scaling property, 

Pxc (71^1 >71^2) = I^pl7^(.iri,-ir2). (6.3) 

In other words, once we have obtained a form for h{ri,r 2 ) that satisfies 
some choosen properties, one gets its A dependence through the application 
of the coordinate scaling approach using (6.3). 

Finally, it is very important to test the new functional on a set of ref- 
erence data to assess its accuracy. For this purpose, many functionals in 
the literature are tested on the popular G2 [30] or, more recently, G3 [31] 
reference set. This set is composed of experimental values known with high 
accuracy such as ionization potentials and atomization energies. However, 
it is not a priori the best set for all of chemistry because, for example, there 
is a lack of transition-metal systems, molecules containing hydrogen bonds 
and other intermolecular interactions and no transition states are included 
in this parametrization set. 

In Section 8 we will present the LAP functional which is an example of 
a functional developed according to this procedure. 

7 Approximate exchange and correlation functionals 

The following will give the flavor of the evolution of exchange-correlation 
functionals and present a few representative examples. A complete listing 
is out of the question®. One could refer to the work coming from the labo- 
ratories of Becke, Perdew and Scuseria, for example, as active laboratories 
working on the development of new exchange and correlation functionals at 



®An extensive listing can be found at: 
http: //www. centrcn.umontreal . ca/^casida/ compchem/DFT.html 
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the present time. We recommend some of their latest publication [32-34] 
and the references therein as entries into the exchange-correlation literature. 

Before presenting the different approximations for ifxc, we should explain 
the difference between the terms local and nonlocal used in the literature 
because it could be a source of confusion. From a mathematical point of 
view the majority of exchange-correlation functionals, with the exception 
of the hybrid schemes, are local functionals because they depend on the 
density and its derivatives only at a given position in space. In contrast, 
the HF exchange energy (5.1) and the exact KS exchange energy func- 
tional are nonlocal because they also need information on other positions as 
well. The term nonlocal is often used to designate the Generalized Gradient 
Approximation (GGA) functionals. This is so, because if one thinks of a 
Taylor expansion of a function, then one can obtained information in the 
vinicity of the current position using the local information on the function 
and its derivatives. In the present chapter, we take the first definition and 
reserve the term nonlocal for mathematically nonlocal functionals. 

Usually the exchange-correlation energy functional is divided into ex- 
change and correlation parts. 



Eyic — Ex + Ec- (f.l) 

We will present each contribution separately. The tilde over indicates 
that from now on we are dealing with approximate E^c functionals. 



7.1 The Local Spin Density Approximation (LSDA) 

In order to develop approximations, we need a model as a starting point. 
In the majority of the exchange-correlation functionals, the homogeneous 
electron gas is used. Particularly, in the local density approximation (LDA) 
the density, p{r), is considered at each point in space and the homogeneous 
electron gas model is applied locally using the density only. This procedure 
is represented in Figure 1. In the local spin density approximation (LSDA), 
the electron density for individual spin components, Pa{r) and ppir), are 
used. 

Using the homogeneous electron gas model, the exchange energy func- 
tional is know as Dirac exchange [35] and has a very simple mathematical 
form, 

^LSDA = -^ J [p„(r)4/3 + p^(r0^/3]dr. (7.2) 
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Fig. 1. Schematic representation of the local density approximation. 



The general form of the local spin density approximation for the correlation 
energy functional has the following form, 

Pfs] = J p{f)ec{pa{r^,P0ir))df, (7.3) 

where edpaif^, Ppi^)) is the correlation energy per electron in a homoge- 
neous electron gas. The exact form is unknown but many approximations 
to £c(/9a(r), p/ 3 (r)) exist. The most popular is due to Vosko, Wilk and 
Nusair (VWN) [36] who used a Fade approximation to interpolate the re- 
sults from a Monte Carlo simulation performed by Ceperley and Alder [37] 
on a homogeneous electron gas. 



£c(7’s) 



A < In 





Q \ 

2x + b) 



bxn 



In 



X{xo) 

2(6 -|- 2xq) 

Q 



V X{x) ) 



■tan ^ 



Q Y 

2x + b J 



(7.4) 



Where X{x) = x^ + bx + c, Q = (4c— 6^)^/^, Xo, b and c are constants. The 
Wigner-Seitz radius, Cg, is related to the density by 



r. 



3 

47T/9 



1/3 



(7.5) 
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LSDA gives fairly good results for equilibrium geometries and vibrational 
frequencies for covalently bonded molecules but has a tendancy to overbind 
atoms because the molecule is overstabilized compared to the separate 
atoms. Moreover, the results are bad for molecules containing hydrogen 
bonds [38] and van der Waals complexes. 

Part of the success of LSDA may be traced to the exchange hole function 
which satisfies (4.13) and (4.14) [18]. Moreover, the symmetrically spherical 
form of the exchange-correlation hole function, needed in the calculation of 
Exc, is a good approximation to the exact one, even though the hole itself 
does not reproduce well the exact hole function [39] . 

In LSDA, the exchange energy is typically underestimated by ~10% and 
the correlation energy overestimated by ~100%. The total energy is too 
high, the gap between occupied and unoccupied orbitals (or bands in solid 
state physic) is too low. Moreover, LSDA favor the d"+^s^ configuration 
over the d”s^ in the 3d transition metal atoms [40]. Gunnarsson and Jones 
[41,42] showed that the major source of error in LSDA comes from the 
exchange energy. A source of error in the calculation of the exchange energy 
is the self-interaction repulsion present in LSDA. This error is related to the 
fact that an electron sees all other electrons including itself. The presence 
of such an error is easy to verify, since the exchange energy of any one- 
electron system should be zero. Perdew and Zunger [43] have proposed 
a self-interaction corrected LSDA functional that reduced the error below 
3% for the exchange energy. Improvement was also obtained in the LSDA 
correlation energy, in the total energy, in the orbital eigenvalues, in the long 
range behavior of v-xc(f), in the shape of the exchange-correlation hole and 
consequently in the electronic density. 

7.2 Gradient Expansion Approximation (GEA) 

The LSDA approximation is useless for thermochemistry. This model was 
developed in the slowly varying density limit, which is far from the case 
in atoms and molecules. A logical next step is to include higher deriva- 
tives of the density to take into account the inhomogeneities. The gradient 
expansion approximation of the homogeneous electron gas is the following. 



pnGEA 

^xc 



\Pon Pp\ 



J p{r)e^c{pa{^, Pldi'i^)dr 



+ E E C^f'{Po.{r^,Pp{f^) 

(T=a,/3 (7 '—oc,(3 
+ ... 



^Pa ^Pa' 

pV\E) p^Jf{r) 



(7.6) 



The second-order gradient expansion aproximation (GEA) is obtained by 
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keeping only the first two terms of the series. The exchange hole obtained 
from the second-order expansion does not satisfy (4.13) and (4.14) and for 
that reason, little improvement over LSDA is obtained. Indeed, in practice, 
results are often worse than for LSDA. 

7.3 Generalized Gradient Approximation (GGA) 

Exchange and correlation functionals that use the second-order expansion 
and manage the difficulties met in the GEA, are know as generalized gradi- 
ent approximations (GGA). It is convenient to express the GGA exchange 
energy as: 



eGGA ^ ^LDA - ^ / F{s.)p^J,\r^dr. (7.7) 

a— a., (5 

The argument of the function F, Sa^ is the reduced density gradient for spin 



Sa(r) 



IVP^I 

4/3/-A ’ 
Pa [r) 



(7.8) 



and corresponds to a local inhomogeneity parameter. The GGA functionals 
have the following general form, 



E^.^^[P\= I /(p(r1,Vp(rO)dr. (7.9) 

Perdew [44] used the second-order expansion with a cut-off radius to im- 
pose conditions (4.13) and (4.14) to the exchange hole function. With this 
procedure, he was able to compute atomic exchange energies within 1% of 
the exact HF exchange energy. Further simplification of the model led to 
the well-know Perdew- Wang86 (PW86) [45] exchange functional. 



^PW86 



l-f 1.296 



0.2 



(247t2)1/3 



14 



(247t2)1/3 



(247t2)1/3 



1/15 



(7.10) 



which is parameter free. 

Another popular GGA exchange energy functional is due to Becke (B88) 
[46]. Becke imposed the correct — 1/r asymptotic behavior for the exchange 
energy density Ex and used one parameter, (3, fitted on the exact atomic 
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HF exchange energy of noble gas atoms. 

FB88 ^ Psl 

1 + 6/3scrSinh ^(s^) 

The GGA correlation functionals have more complicated mathematical 
forms. We limit ourself to mention the well-known Perdew86 (P86) [47,48] 
correlation functional that contains one fitted parameter and the Perdew- 
WangQl (PW91) functional [49-51] that is parameter free. This last func- 
tional has an exchange counterpart. 

Becke [52] showed that the inclusion of GGA exchange and correlation 
functionals using the combination B88 for exchange and PW91 for corre- 
lation reduced the absolute error and the overbinding tendancy of LSDA. 
The average absolute error in the atomization energy on a set of 56 small 
molecules was reduced from 35.7 kcal/mol to 5.7 kcal/mol. The improve- 
ment was not as important for the ionization energy of 46 small molecules. 
The average absolute error was reduced from 5.3 kcal/mol to 3.5 kcal/mol. 
With GGA exchange-correlation functionals we get better hydrogen bonds, 
good thermochemistry but still not chemical accuracy. Further improve- 
ments are needed for the van der Waals complexes and activation energies. 

The LDA exchange-correlation potential does not have the right asymp- 
totic behavior at long distance [53,54]. For that reason, equations (5.4) and 
(5.5) are not satisfied as can be seen from Tables 4 and 5. From Table 5 it is 
demonstrated that the HOMO has a positive eigenvalue (unbound) for an- 
ions. This is the case also for all negatively charged molecules. Adding the 
gradient correction does not lead to significant improvement since the bad 
asymptotic behavior of LDA is not corrected by the addition of gradient 
corrections [53,54]. For example, the exchange functional of Becke, B88, 
falls off too fast asymptotically (— 1/r^ instead of — 1/r) [22]. Moreover, 
Engel et al. [22] and van Leeuwen and Baerends [55] showed that it is im- 
possible for a GGA functional defined as (7.7), i.e. containing only Scr(r)) to 
satisfy simultaneoulsy the asymptotic behavior of the exchange-correlation 
energy functional and the exchange-correlation potential. These conditions 
can be fulfilled by the addition of the Laplacian of the density into the 
function F. Using an asymptotically corrected exchange-correlation poten- 
tial like LB94 [55] improves the results (see also Ref. [56]). Asymptotically 
corrected potentials are also essential for a proper description of electronic 
excitation spectra [57]. In Table 4 we present the value of — Ehomo obtained 
with the exact KS-DFT exchange potential (without correlation) which has 
the right asymptotic behavior. In DFT, in contrast to HF, no correction due 
to electronic relaxation has to be added. Discrepancies between — e§o^O 
and experimental IPs are due to the lack of correlation in the calculations. 
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Table 4. Comparison of the ionization energies (in a.u.) obtained from the highest 
occupied molecular orbital energies (HOMO) with the experimental ionization 
potential (IP). (HF, experimental, LDA, B88P86 and LB94 results are taken from 
Ref. [55] and OEP from Ref. [58]) 



Atom 


— ^homqHF 


IP 


,OEP 

^HOMO 


^LDA 

^HOMO 


,B88P86 

^HOMO 


XB94 

%OMO 


H 


0.500 


0.500 




0.234 


0.280 


0.440 


He 


0.918 


0.903 


0.918 


0.571 


0.585 


0.851 


Be 


0.309 


0.343 


0.309 


0.206 


0.209 


0.321 


Ne 


0.850 


0.792 


0.851 


0.490 


0.496 


0.788 


Ar 


0.591 


0.579 


0.591 


0.381 


0.380 


0.577 


Kr 


0.524 


0.517 


0.523 


0.346 


0.344 


0.529 


Xe 


0.457 


0.446 


0.456 


0.310 


0.308 


0.474 



Table 5. Comparison of the electron affinity (EA) (in a.u.) obtained from the 
highest occupied molecular orbital energies (HOMO) with the experimental EAs. 
(From Ref. [55]) 



Anion 


EA 


,LDA 

^HOMO 


,B88P86 

^HOMO 


XB94 

^HOMO 


P- 


0.125 


-0.097 


-0.099 


0.128 


Cl- 


0.133 


-0.022 


-0.023 


0.140 


Br“ 


0.124 


-0.008 


-0.009 


0.140 


I" 


0.112 


-b0.005 


-b0.004 


0.139 



7.4 meta-Generalized Gradient Approximation (meta-GGA) 

We can go beyond the GGA approximation by adding an explicit depen- 
dence on the Laplacian of the density, V^p(r), and the kinetic energy 
density^, 

occ 

T.(r) = ^|V(^f(r)p, (7.12) 

for the calculation of exchange-correlation energy. These two quantities ex- 
hibit atomic shell structure and may be viewed as inhomogeneity 



^We should mention that different definitions exist in the literature for the kinetic 
energy density, here we take the definition of Becke [59] . 
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parameters used to get nonlocal information about the exchange-correlation 
hole function. 

Exchange-correlation energy functionals that could be written in the 
following general form are commonly called the meta-generalized gradient 
approximation (meta-GGA) , 

^meta-GGA[^] = J /(p(r), Vp(r) , V2p(r ) , r(rO)df. (7.13) 

Adding the kinetic energy density into the exchange-correlation energy func- 
tional brings extra computational work in the KS calculation. In the SCF 
procedure, the exchange-correlation potential, Uxc(tO> has to be calculated. 
The problem arises because r is not an explicit functional of the density. 
Thus, approximations have to be programmed. 

Among this kind of functional we mention the exchange energy func- 
tional of Becke and Roussel [60] , the exchange-correlation energy functionals 
of Perdew et al. [33], Van Hoorhis and Scuseria (VSXG) [61] (which contains 
21 parameters...), and the correlation energy functionals of Proynov et al. 
(LAP, Bmrl) [62-67] and of Lee, Yang and Parr (LYP) [68]. This family 
of functionals solve the majority of the problems met with GGA. Part of 
the success lies in the orbital-dependent exchange-correlation energy. meta- 
GGA exchange-correlation energy functionals seem to be a good alternative 
to the GGA and hybrid schemes which are the subject of the next section. 

7.5 Hybrid functionals 

Another kind of approximation to go beyond the GGA was proposed by 
Becke [69]. He suggested to mix some of the exact KS exchange energy 
with a traditional exchange-correlation energy functional. His argumenta- 
tion was based on the adiabatic connection formula. He showed [70] that 
the local electron gas model is inapropriate near A = 0 and used the H 2 
molecular system to illustrate this. At A = 0, the noninteracting limit, the 
exchange-correlation energy is the exact KS exchange energy without corre- 
lation. For H 2 , the exact exchange hole function, is the negative 

of the (Jg orbital density and, consequently, is symmetrically delocalized over 
both atoms. On the other hand, the exchange hole function of the local elec- 
tron gas model is more or less localized by its nature. This hole is a bad 
approximation for the exact hole near the A = 0 limit. As a remedy, Becke 
suggested to use the exact KS exchange energy at A = 0 and LSDA [69] or 
GGA [70] at A = 1. As an improvement of his first half-and-half model [69] 
he proposed [70], 

£,B3PW91 ^ £,LSDA ap(^Exact _ ^LSDA) Ox -k , (7. 14) 
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where oq = 0.20, Ox = 0.72 and Oc = 0.81. These parameters were de- 
termined by a linear least-squares fit on an experimental reference data 
set [30,71] rather than from the adiabatic connection formula. The parame- 
ter ag represents the amount of exact exchange needed for the correction of 
LSD A or GGA near the A = 0 limit. A value of 20% is typical for all hybrid 
schemes as shown by the recent development of nonempirical (no fitting pro- 
cedure on a set of reference data) hybrid functionals [72,73]. B3LYP [74] 
is the most popular hybrid scheme. The difference with the above hybrid 
scheme is the replacement of the PW91 correlation energy functional by the 
LYP correlation functional. The values of the parameters are however the 
same. B3LYP gives better results than B3PW91 and other GGA function- 
als for organic molecules. Results for transition metal compounds and for 
hydrogen bonds, for example, do not appear to represent any improvement 
over GGA functionals. 

Z.6 The Optimized Effective Potential method (OEP) 

The exchange energy is the largest component of exchange-correlation en- 
ergy. In addition to the exact calculation of Tg proposed by Kohn and 
Sham [15], one may think of computing exactly the KS exchange energy 
using. 



^exact 



[p] 




T%{ri)TlArV)Tka{ri)V3'y{^2) 

\ri - T2\ 



dridr2. 



(7.15) 

Then the only approximation in a KS calculation comes from the approxi- 
mate form for the correlation energy functional. One of the major problems 
with this kind of approach, is the exchange-correlation potential needed in a 
KS calculation. It implies the solution of complicated equations. The exact 
derivation of Wxc(?^) leads to the optimized effective potential (OEP) [75,76] 
method. It should be emphasized that, in the OEP method, the energy 
expression, Ex, involves the full non-local exchange but the potential, Uxc is 
local. We recommend the paper of Grabo et al. [77] as a good introduction 
to the OEP formalism. 

The OEP equation must be solved numerically because no simple ana- 
lytical form exists. This explains why its application has been limited to 
atoms. A good approximation is due to Krieger et al. [78] known as the KLI 
approximation. This method suffers from the computational effort which is 
very high compared to GGA and meta-GGA functionals. The application 
of the KLI method is limited to small molecules. Finally, we should mention 
the problem related to finding a suitable correlation functional because the 
cancellation of errors between exchange and correlation contributions which 
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make the success of LDA is absent. However, better total energy, orbital 
energies (see Tabs. 4 and 5), excitation spectra and all the properties that 
depend on the KS orbitals could be obtained with better accuracy than 
with any other type of functional presented here. In contrast to the hybrid 
scheme, the OEP-based exchange potentials are local, which respects the 
pure Kohn-Sham DFT philosophy. 



7. 7 Comparison between various approximate functionals 

In this section, we illustrate the performance of different KS-DFT schemes 
for thermochemistry. The listing is not exhaustive because there are many 
comparisons in the literature and we just want to give an idea of the accuracy 
of the approximations previously presented. 

In Table 6, we compare different exchange-correlation approximations 
for thermochemistry. We should mention that these functionals do not con- 
tain any fitted parameters and the experimental data are corrected for the 
zero point energy. The lack of correlation makes HF useless. LSDA per- 
forms better than HF without increasing the computational time. One can 
see the over binding tendency of this approximation. The GGA functional 
improves quantitatively the accuracy but it is still far from chemical accu- 
racy (± 1 kcal/mol). The inclusion of the Laplacian of the density seems to 
be necessary for a quantitative description. Further improvement is needed. 
We mention that the popular empirical hybrid scheme B3LYP gives a mean 
absolute error (MAE) of 2 kcal/mol on this set of molecules [61]. But remem- 
ber that this functional contains parameters fitted to reproduce atomization 
energies of a set of molecules, including those in Table 6. 



8 LAP correlation functional 

In this section we present the LAP correlation functional family [62-67] 
which is a meta-GGA correlation energy functional developed according to 
the adiabatic connection method, following the recipe given in Section 6. 
First we reformulate the adiabatic connection method as, 

E,[p]= r E^[p]dX, (8.1) 

Jo 



to justify its application to the correlation energy alone. The exchange en- 
ergy functional is taken out of the integral because the exact KS exchange 
energy is defined as the A = 0 limit of the adiabatic connection method 
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Table 6. Atomization energies (in kcal/mol) of small molecules computed with 
various exchange-correlation functionals. (UHF, LSDA and PW91 results are 
taken from Ref. [79]). Experimental and meta-GGA from Ref. [33]). 



Molecule 


A^uhf 


A^lsda 


AAPW91 


^meta— GGA 


A£:expt 


H 2 


84 


113 


105 


115 


109.5 


LiH 


33 


60 


53 


58 


57.8 


CH 4 


328 


462 


421 


421 


419.3 


NH 3 


201 


337 


303 


299 


297.4 


OH 


68 


124 


110 


108 


106.4 


H 2 O 


155 


267 


235 


230 


232.2 


HF 


97 


162 


143 


139 


140.8 


Li 2 


3 


23 


20 


23 


24.4 


LiF 


89 


153 


137 


128 


138.9 


Be 2 


-7 


13 


10 


5 


3.0 


C 2 H 2 


294 


460 


415 


401 


405.4 


C 2 H 4 


428 


633 


573 


562 


562.6 


HCN 


199 


361 


326 


312 


311.9 


CO 


174 


299 


269 


256 


259.3 


N 2 


115 


267 


242 


229 


228.5 


NO 


53 


199 


171 


159 


152.9 


O 2 


33 


175 


143 


131 


120.5 


F 2 


-37 


78 


54 


43 


38.5 


P 2 


36 


142 


120 


118 


117.3 


CI 2 


17 


81 


64 


59 


58.0 


MAE 


71.2 


31.4 


8.0 


3.1 





and therefore, is independent of A. The spin-dependent correlation energy 
functional can be divided as, 

E, [p] = iff [p] + 2S Ti [p] + ^ (8.2) 

where and represent the correlation energy of electrons with parallel 
spin beyond the exchange energy and E^^ represents the correlation energy 
for electrons with opposite spin. This last contribution is bigger than the 
other two and was the only correlation component included in LAPl [62,63]. 
The parallel spin correlation is included in LAP3 [ 66 ] and rl [67] which are 
the recent improvements of the LAPl functional. 

For each different type of contribution to the correlation energy, a spin- 
dependent pair correlation function of Colle and Salvetti [80] is defined and 
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the A-dependence is introduced through the coordinate scaling approach 
[29]. The inhomogeneities are taken into account by local thermodynamic 
analogy [5] using the Laplacian of the density and the kinetic energy density 
defined as, 



^ ucc - 

= ( 8 . 3 ) 

i 

There are (at least...) two points of view on the construction of a func- 
tional. A functional free of parameters that is based on physical arguments 
should give better results in general but can be less accurate for a spe- 
cific set of reference data when compared to a functional fitted to repro- 
duce the results of this reference set. The LAP family of functionals is a 
good compromise between these two approaches. The equations are derived 
based on physical arguments and then a relatively small set of molecules 
(N2, F2, O2, HP, CN, H2O, NH3, CH4, C2H4, C2H6, CgHg, (H20)2) has 
been used to parametrize the correlation energy functional and to synchro- 
nize the LAP correlation energy functional with the exchange functional of 
Becke (B88) [46] and Perdew-Wang86 (PW86) [45]. Such synchronization 
is essential when exchange and correlation energy functionals are developed 
separately. 

The LAP correlation functionals are then tested on a set of 40 small 
and medium sized molecules which represent difficult cases at the LSDA 
and GGA level and for which very accurate experimental atomization en- 
ergies can be extrapolated to 0 K. For difficult cases like hydrogen bonds, 
activation barriers and some transition metal compounds the results are 
comparable to GGSD(T) [81-84]. We should mention that the results for 
transition metal compounds are good even though no transition metals were 
included in the parametrization step, an indication of the good physics in- 
corporated in the functional. 



9 Solving the Kohn-Sham equations 

Ghoosing an exchange-correlation energy functional based on the properties 
of interest is not the only decision to make in a typical KS-DFT calculation. 
In this section, we give an overview of the possible choices based on our 
own KS-DFT program, deMon-KS® [85-87]. These choices, together with 
representative references, are given in Figure 2. Sources of error related to 
these choices are compiled in Table 7. These choices influence directly the 



®The program is available at: http://www.sims.nrc.ca/sims/deMon/ 
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computational time. Informed decisions balancing accuracy and speed have 
to be made for the majority of every-day applications. 




Fig. 2. Schematical representation of the KS choices. 

First of all, we should mention that it is possible to solve the KS equa- 
tion fully by numerical integration. This is how KS-DFT calculation are 
performed within the NUMOL program [120] of Becke. This kind of cal- 
culation does not suffer from basis set incompleteness but is subject to 
numerical noise coming from the numerical integration procedure. How- 
ever, this is the method of choice for the determination of the performance 
of a new exchange-correlation functional because it excludes all the errors 
coming from using a basis set. This way of solving the KS equation is very 
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demanding on computational ressources which limits its application to small 
molecules. 

9.1 The Kohn-Sham orbitals 

The linear combination of atomic orbitals (LCAO) approximation is the 
most used representation of the KS orbitals, in KS-DFT programs. 

This approach is due to Roothaan [117] and was first applied to traditional 
ab initio methods. A good detailed description is given in [4]. In the LCAO 
approach each (r) is expanded in a linear combination of atom-centered 
basis function 



Tiir,cr) = (9.1) 

where are coefficients determined by the SCF procedure. Each atomic 
basis function is also a linear combination of contracted primitives. The 
most popular choices are Gaussians [121], because integrals and derivatives 
are analytical and easy to calculate, and Slater functions [122] which cor- 
respond to the exact solution of hydrogen-like atoms and then are a better 
representation of the radial part of atomic functions. Gaussians® are gener- 
ally prefered over Slater functions because their manipulation is easier even 
though more functions are needed. Using cartesian Gaussian functions, 

g{r, I, a) = (9.2) 

with the normalization factor given by, 

\3/4 

(4a)('^+'«+'^)/2[(2;, - 1)!!(2;^ - 1)!!(2;, - l)!!]-i/", (9.3) 
an atomic basis function can be written as 

iVp 

Xm(^) = X!^w5'p(g^.“p)- (9.4) 

p 

The contraction coefficients, d^p, and the exponents, Op, are optimized sep- 
arately only once and are kept fixed for all successive calculations. The 
optimization procedure of atomic basis sets using Gaussian functions is de- 
scribed in [123]. The linear combinations in equations (9.1) and (9.4) are 
not infinite which introduces an error known as the basis set incompleteness. 



® Different basis sets using Gausssian functions are available at 
http://www.emsl.pnl.gov : 2080/forms/basisf orm.html 
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Table 7. Sources of error in KS-DFT calculations. 



Sources of error 

Exchange-correlation functional 
Orbital basis set incompleteness 
Auxiliary basis set incompleteness 
Basis set superposition error 
Fitting procedure 
Numerical integration 
Molecular grid 
Relativity 

SCF convergence criteria 
Geometry convergence criteria 
Solvation 
Finite system 



Introducing the LCAO approximation (9.1) into the KS equations (2.24), 
multiplying to the left by an arbitrary basis function, integrating 

over the whole space, we obtain a matrix equation, 

= SC£, (9.5) 

where the matrix elements are given by 

= j (9.6) 

= J Xi^{r)xi_c{r)df, (9.7) 

which constitute a linear system of equations to be solved by standard 
techniques. Efficient algorithm design for the integration of (9.6) and (9.7) 
is presented in [88,89]. The time required to solve (9.5) is directly related to 
the total number of atomic functions^®. This number can be reduced simply 
by using an atomic basis set optimized for DFT calculations (such as the 
one presented in [118,119]) instead of a big basis set needed for post-HF 
methods, where higher flexibility is needed to get as much as possible of the 



^'^The diagonalization of the KS matrix scales as O(N^), where N is the number 
of atomic basis function. This step of the SCF procedure can be reduced to 0{N) for 
large system using special numerical techniques because in this case, the KS matrix is 
sparse [124]. 
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total electronic correlation. Moreover, using an atomic basis set for DFT 
reduces the basis set superposition error (BSSE). The choice of a basis set 
is as important as the choice of an exchange-correlation functional. 

9.2 Coulomb potential 

Using the following definition for the electronic density in the LCAO ap- 
proach, 



N 

(9-8) 

the coulomb potential can be evaluated exactly [88, 89] using 




Consequently, the evaluation of the coulomb potential depends on the square 
of the total number of basis functions {N'^) and its contribution to the 
evaluation of grows as N'^. To reduce the computational time, one 
may approximate the electronic density by a fit on an auxiliary atomic 
centered basis set [90,91], 



Nk 

p{r) p{f) = ^akfk{r), (9.11) 

k 

where the ak are fitting coefficients and fk{^ are fitting functions 
(Gaussian, Slater, ...). The tilde indicates a fitted function. An optimiza- 
tion procedure similar to the orbital atomic basis set is employed to get the 
exponent of the primitive functions, /fe(tO- The fitted density is equal to 
the exact density in the limit of a complete auxiliary basis set which is not 
the case in practice. The coulomb potential with the fitted density becomes 



Vein) 




(9.12) 



With the fitted density, the cost of the evaluation the coulomb energy needed 
in is reduced to N'^Nk- Since the number of fitting functions is smaller 
than the number of basis functions, this procedure saves a lot of CPU time. 
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One has to use the fitted density to get the scaling attributed to DFT 
in the literature; otherwise it is formally as for HF. 

One can go one step further, knowing that electron-electron coulomb 
repulsion is a short range interaction, by using the fast multipole method 
[92]. This approach is used in almost all order- fV methods. 

9.3 Exchange-correlation potential 

The exchange-correlation potential needed for the solution of the KS 
equation (2.24) is a much more complex mathematical function than the def- 
inition of the exchange-correlation energy functional presented in 
Section 7. For that reason, the exchange-correlation potential and energy 
are calculated numerically on a grid^^ and constitute the bottleneck in the 
solution of the KS equations. As already mentioned, the results can be 
influenced by numerical noise if a grid with not enough points is used. To 
reduce the computational time during the SCF procedure, one can calculate 
the exchange-correlation potential on a reduced grid and then use a fitting 
procedure [90], 



Nk 

<c(^ « «xc(^ = (9-13) 

k 

where bk are fitting coefficients determined by least squares fit and 
are fitting functions. Once again, error is introduced because the new aux- 
iliary basis set in not complete. This procedure is accurate only if one or 
several extra iterations are performed on an augmented grid when the SCF 
calculation is converged. (This is how the exchange energy contribution 
is calculated in deMon-KS). This procedure is still time consuming even 
though the fitting procedure requires a smaller grid than a full numerical 
intregration. To further decrease the computational time, one can avoid the 
numerical evaluation of the exchange-correlation potential and energy on a 
grid using a grid-free technique [113-116]. 

9.4 Core potential 

Core electrons play a limited role in the chemistry of molecular systems 
because they are tightly bound to the nucleus. For that reason, it is attrac- 
tive to separate an atom into core and valence regions. Then, the effects 
of the core electrons are simulated by an analytical one-electron potential. 



^^Explanation of the details of the three dimensional numerical integration in molecular 
systems is beyond the scope of this chapter. The reader is refered to recent contributions 
[109-112] for technical details. 
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Ccore(lO) a truncated basis set is used to treat the valence electrons 
explicitly. The smaller size of the basis set compared to an all-electron cal- 
culation leads to an appreciable computational time saving. Moreover, they 
can also include relativistic effects through an approximate treatment of 
mass-velocity and Darwin terms during the optimization procedure. This 
effect is very important for a good description of compounds containing 
second and third row transition metals. The different methodologies can 
be divided into three categories, pseudo-potential (PP) [94-96], model core 
potential (MCP) [98] and effective core potential (ECP) [101-104,106-108]. 
This last type of core potential is usually optimized for ab initio techniques. 
We recommend the review articles of Frenking et al. [125] and Cundari 
et al. [126] for more details. 

9.5 Other choices and sources of error 

As mentioned in Section 2.3, KS equations have to be solved self-consisten- 
tly. The iterative process is stopped when the change in the total energy and 
in the density is less than a threshold supplied by the user. Typical values 
are 1 x 10“® and 1 x 10“^ a.u. for the energy and the density respectively. 
There is also a value to give for the determination of the convergence during 
a geometry optimization. Depending on the program, there is between two 
to four quantities to check for convergence. For example, the maximum 
atomic contribution to the gradient, the root mean square (RMS) of the 
gradient, the maximum displacement, and the RMS of the displacement. In 
general, all the convergence criteria mentioned so far possess default values 
in DFT programs. These values are not necessarily adapted for your specific 
problem and must be checked. 

Beside technical errors, errors can arise from the choice of the model. 
For example, using an over simplified model for a biological system can re- 
sult in a bad description of the cooperative effects [127]. Solvation is also 
critical in this kind of system. Finite size systems can also be a source of er- 
ror in particular situations. For example, a cluster model is not necessarily 
the best model to study the chemistry that occurs on a surface since its be- 
haviour is not the same as the bulk when small sized clusters are considered. 
Attention has to be paid when selecting a model. 

9.6 Functionality 

Finally, we conclude this section by giving a list of functionalities available 
in different DFT programs with some representative references: 



• geometry optimization (minima, transition states and saddle points) 
[128]; 
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• ab initio molecular dynamics simulation [129,130]; 

• molecular vibrations, Infrared and Raman intensities [131]; 

• charge analysis [132,133]; 

• dipole moments, polarisability and hyperpolarisability [134-136]; 

• molecular electrostatic potential and its critical points [137]; 

• electronic excitations [136,138]; 

• NMR and EPR spectra [139-143]; 



• ionization potential, electronic affinity, atomization and binding 
energy; 



• reaction path following (IRC) [144]; 



• X-ray absorption [145] and emission [146] spectra; 



• vibrational Circular Dichroism [74,147]; 



• solvent effects [142,148]. 



10 Applications 

Here we present just a few examples to indicate some areas where DFT has 
been used to solve real chemical problems and to provide valid insight into 
properties and processes. The literature is rich, and the reader should have 
no difficulty finding other exciting paths to follow. 
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10.1 Ab initio molecular dynamics for an alanine dipeptide model 

DFT and ab initio calculations are usually performed in a static mode. That 
is, they yield information on extrema, like minima and transition states, but 
not on properties that depend on the dynamics of the system. To get such 
information, a common approach is to use classical molecular dynamics 
(CMD) simulations by solving Newton’s equations of motion. 



d "^Xi dE 

dt dxi ’ 



( 10 . 1 ) 



using a classical force field to get the forces {—dE/dxi) felt by an atom 
(where is the atomic mass of atom i and Xi is one of its 3 carte- 
sian coordinates). The ability of this kind of calculation to reproduce 
all the interactions present in the system under consideration, is related 
to the quality of a given force field, which depends on the parametriza- 
tion and the nature of the terms included in it. Recent work in several 
laboratories is revealing the importance of polarization terms. 

In an AIMD simulation, a DFT or an ab initio electronic calculation is 
performed at each time step and the forces between atoms are obtained from 
the first derivatives of the total energy with respect to the nuclear positions. 
This approach has been introduced by Car and Parrinello [129]. With the 
use of DFT it is possible to compute all the interactions from first principles. 
The description of hydrogen bonds is now under reasonable control with the 
new generation of functionals [83] but further improvements are needed for 
van der Waals interactions which are largely dominated by dispersion. 

Recently, Wei et al. [149] have performed an AIMD simulation on (S)- 
Qf-(formylamino)propamide, an alanine dipeptide analogue, using DFT. A 
scan of the potential energy surface reveals two low energy conformers, C7eq 
and C5, (represented in Fig. 3) characterized by two dihedral angles, (j) and 

defined by C3-C2-N1-C1 and N7-C3-C2-N1 respectively. The value of the 
angles {(j>, tp) is (—80,80) and (—160, 160) for C7eq and C5 respectively. 

Using the combination B88 [46] for exchange and P86 [47,48] for corre- 
lation, the C7eq conformer is lower in energy by 1.7 kcal/mol. The result of 
the AIMD simulation is presented in Figure 4. 

A hydrogen bond is formed each time the distance between Oi • • • Hy 
(solid line) or O 2 • • • H 2 (dotted line) is around 2 A. From Figure 4, we 
can see that both conformer regions are frequently visited at room tem- 
perature and the transformation takes about 3 ps to be completed. In 
contrast, no transition from C7eq to C5 conformer is observed for a CMD 
simulation lasting many nanoseconds, using popular classical force fields, at 
298.15 K. But, at 375 K a CMD simulation using CHARMM [150], showed 
a conformational transformation between C7eq and C5. This difference is 
not coming from a bigger activation barrier since the value obtained with 
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C7eq C5 



Fig. 3. 2D schematical representation of C7eq and C5 isomers of (S)-a- 
(formylamino)propanamide. 




Fig. 4. Time evolution of the Oi • • ■ H 7 (solid line) and O 2 • • • H 2 (dotted line) 
hydrogen bonds at 298.15 K. 



CHARMM force field, 2.4 kcal/mol, is similar to the value of 2.5 kcal/mol 
obtained with DFT. Then, as propose by Wei et al. [149], force fields 
are probably not appropriate for gas phase transformations. This is not 
surprising because usually, force fields are parametrized from experiment, 
which include solvent effect and as a consequence, there is less flexibility 
in the whole molecule coming from constraints of the dihedral angles. This 
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example showed the importance of treating properly the electronic correla- 
tion and charge polarization in biological molecules. There is a great need 
for force fields that incorporate this added complexity. 

10.2 Transition metal clusters: The ecstasy, and the agony... 

The next two examples concern small transition metal clusters. Clusters 
with less than 20 atoms do not behave like the bulk. They are character- 
ized by shorter bond length and higher magnetic moment per atom when 
compared to the bulk values and there are many low-lying excited states. 
For these reasons, the accuracy of the results may be highly influenced by 
the technical details. For example, many tests have to be performed in or- 
der to determine a good basis set and a grid that contains enough points 
for the numerical integration of the exchange-correlation potential. 

10.2.1 Vanadium trimer 

It is impossible to determine the ground state structure of V3 from exper- 
iment [151] or from calculation [152-156] alone. This is generally the case 
for transition metal clusters in the gas phase except for a few dimers. For 
a review of gas phase experiments on small clusters in the gas phase we 
recommend the article of Simard et al. [157]. 

The DFT results for V3 and obtained in Ref. [156] are presented 
in Table 8. There are two states very close in competition for the ground 
state, a ^ state with an equilateral shape and a ^^2 state which is an acute 
triangle. The energy difference, 0.03 eV, is clearly within typical error bars 
for DFT (GGA) applications to this type of system (errors around 0.5 eV for 
dissociation energies are not uncommon and, while some cancellation should 
occur when comparing different molecular states, it is probably prudent to 
keep 0.5 eV (~10 kcal/mol) as a rough margin of uncertainty). The bond 
energy computed from both states are very close to the experimental value 
of 1.42 ± 0.10 eV [158]. Moreover, the authors point out the importance 
of the basis set to get the correct energy ordering of the two states of V3. 
They showed that a basis set optimized for LSDA calculations [118] was not 
good enough, preparing a GGA-optimized one proved to be worthwhile. 

The situation is also ambiguous for . A ^A '2 state could be chosen as 
the ground state based on the energy difference but a state based on 
the bond energy. The experimental bond energy of is 2.27 ± 0.09 eV. 

Recently, the comparison between the experimental PFI-ZEKE spectra 
and the simulated ones from KS-DFT calculations, has been successfully 
applied to determine the ground state structure of NbsO, NhsO^, Nb3G2 
and Nb3G2“'' [159,160]. Galaminici et al. [156] have applied the same pro- 
cedure to V3 and V^. The experimental [151] and the simulated spectra for 
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Table 8 . KS-DFT results for V 3 and from [156]. 





V 3 




^ 3 " 






M 2 


M' 


Mi 


Relative energy 


0.00 


0.03 


5.61 


5.79 


Bond energy 


1.59 


1.60 


2.10 


2.27 


d(Vi - V 2 ) 


2.169 


2.335 


2.180 


2.211 


0{ViV2V3) 


60.0 


49.7 


60.0 


60.0 



V 3 (^A[) —>■ (^^ 2 ) ^ (^^ 2 ) ^ (^^ 2 ) presented in Figure 5. 

The very good agreement between the experimental and the first simulated 
spectra allowed Calaminici et al. to determine the ground state of V 3 and 
unambiguously as ^A[ and ^ A!r^ respectively. Clearly, modern DFT can 
be a powerful partner for modern experiment. 

10.2.2 Nickel clusters 

A systematic study on transition metal clusters implies the calculation of 
many structures without symmetry constraints, in order to allow the cluster 
to relax, for example, according to the Jahn-Teller theorem. Each of the 
starting geometries must be fully optimized for many multiplicities and 
all the different optimized structures must be characterized by vibrational 
analysis to determine the nature of the stationary point found. These are 
difficult systems, requiring close attention to technical details. The lack of 
detailed computational information or protocol [161] followed makes it hard 
to assign the quality of some of the published computational studies, for 
example, on M 4 and M 5 . 

The lowest energy structure of Ni 4 obtained with at least four differ- 
ent DFT programs, various exchange-correlation functionals, basis sets and 
grids, are presented in Table 9. This Table illustrates the need to carefully 
choose the computational details for the characterization of transition metal 
clusters. 

The ground state structure of Ni 4 is unknown experimentally. A dis- 
torted tetrahedron (D 2 d) is obtained as the lowest energy structure using 
different computational choices [162-167]. Reddy et al. [168] obtained a 
perfect tetrahedral structure (Td) probably because they used the fractional 
occupation of molecular orbitals to help the SCF convergence. The use of 
this procedure tends to favor highly symmetric structures when it is not 
removed completely at the end of the calculation. The absence of vibra- 
tional frequencies make this result questionable. Other authors founded an 
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Frequency (cm‘^) 

Fig. 5. Experimental (top panel) and simulated PFI-ZEKE spectra of V 3 at 298.15 
K. Middle panel: V^{^A' 2 ), lower panel: Vz[^A'-^) — > V^(^A' 2 ). 



imaginary frequency for that structure [164,165] which indicates a Jahn- 
Teller active structure. From [164] the relaxed structure, a distorted tetra- 
hedron, is lower in energy by only 0.025 eV at the GGA level. 

It is hard to comment on the details of the geometry of Table 9 because 
different functionals, basis sets and grids have been used. Gomparing the 
LSDA results obtained with deMon-KS and DGAUSS using the same basis 
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set, one may think that the difference is coming from the small grid used 
by Castro et al. [164], 832 points per atom compare to at least 7760 points 
by Cisneros et al. [166]. We have repeated the calculation of Castro et al. 
with the same KS-DFT program, basis set and functional but we used more 
stringent convergence criteria and we increased the total number of points 
per atom to 20 952. The results are the same at the LSD A level but with the 
GCA PW86P86 functional we get slightly different results; a = 2.23 A and 
b = 2.39 A closer to the DGAUSS results. The remaining differences could 
be due to the functional used or to the small differences in the computational 
details. At the GCA level, calculations using an ECP agree well with each 
other but they seem to favor a larger value of the parameter “a” when 
compared to all electron calculations. 

Concerning the electronic state, all the computational studies obtained a 
cluster with 4 unpaired electrons (multiplicity of 5) [163-168] except Reuse 
and Khanna [162] who obtained 6. In [163, 168] it is mentioned that Reuse 
and Khanna have done again the calculations using non relativistic all elec- 
tron basis set and their results are in agreement with all other studies. At 
the present time it is not clear if the discrepancy is coming from the pres- 
ence of relativistic effects or from the lack of core-valence polarization in 
the pseudo-potential. In principle, the issue could be resolved by a con- 
verged all-electron basis set calculations, using a relativistic treatment but 
that is beyond the state of the art for systems like Ni4. The ground state 
of Ni4 cannot be assigned with certainty as a distorted tetrahedron with 4 
unpaired electrons because there are a lot of low-lying states. For example, 
Michelini, Diez and Jubert found three isomers within 0.1 eV above the 
lowest state. As already mentioned for V3, an energy difference less than 
0.5 eV leaves uncertainty in the identity of the ground state. 

All DFT studies on M5 [162-164, 168] agree on a trigonal bipyramid 
as the lowest energy structure (see Tab. 10). In three of the studies 
[163,164,168], the lowest energy structure possesses 4 unpaired electrons 
giving rise to a multiplicity of 5. As already mentioned just above. Reuse 
and Khanna have re-investigated their calculations performed in [162] and 
they have found also 4 unpaired electrons instead of the 8 obtained previ- 
ously. Recently, Aspel et al. [173] have measured the magnetic moment per 
atom in nickel clusters containing 5 to 740 atoms. In the case of Nis they 
obtained 1.6 /xb per atom corresponding to 8 unpaired electrons. Many 
potential explanations have been proposed beyond experimental error and 
errors coming from the calculations. It is possible that the experiment does 
not detect the ground state but, rather, a higher energy structure. This is 
possible because we do not know the temperature exactly in this kind of 
experiment. During the school, Gustavo Pastor pointed out the possibil- 
ity of a significant contribution to the magnetism coming from the orbital 
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Table 9. Comparison of the lowest energy structure of Ni 4 obtained from different 
programs, basis sets and functionals. 



Program 


XC func. 


Sym. 


R,{k) 


Ns 


Ref. 




LSDA“ [37] 


D2d 


a = 2.12; b = 2.74 


6 


[162] 


ADF [169] 


LSDA^ [36] 


D2d 


a = 2.19; 6 = 2.31 


4 


[165] 


deMon-KS [85-87] 


LSDA“ [36] 


D2d 


a = 2.15; 6 = 2.28 


4 


[164] 


DGAUSS [170] 


LSDA“ [36] 


D2d 


a = 2.18; 6 = 2.26 


4 


[166] 


deMon-KS [85-87] 


PW86P86“ 


D2d 


0 = 2.21; 6 = 2.41 


4 


[164] 


DGAUSS [170] 


B88P86“ 


D2d 


0 = 2.24; 6 = 2.35 


4 


[166] 


G94 [171] 


B88PW91‘^ 


D2d 


0 = 2.28; 6 = 2.38 


4 


[163] 




BLYP® 


D2d 


0 = 2.26; 6 = 2.40 


4 


[167] 


DMOL [172] 


BLYP'' 


Td 


2.24 


4 


[168] 



‘‘Norm-conserving pseudo-potential from [94]. 
^‘Frozen core approximation used up to 3p. 
“All electron basis set from [118]. 

‘^Effective core potential. 

“Effective core potential from [100]. 
^Numerical basis set. 



Table 10. Comparison of the lowest energy structure of Nis obtained from dif- 
ferent programs, basis sets and functionals. 



Program 


XC func. 


Sym. 


i?e(A) 


Ns 


Ref. 




LSDA“ [37] 


Dsh 


o = 2.23; 6 = 2.29 


8 


[162] 


deMon-KS [85-87] 


PW86P86'" 


Dsh 


0 = 2.31; 6 = 2.36 


4 


[164] 


G94 [171] 


B88PW9U 


Dsh 


0 = 2.30; 6 = 2.38 


4 


[163] 


DMOL [172] 


BLYP'^ 


Dsh 


0 = 2.24; 6 = 2.29 


4 


[168] 



“Norm-conserving pseudo-potential from [94]. 
*‘A11 electron basis set from [118]. 

“Effective core potential from [100]. 
‘^Numerical basis set. 



angular momemtum in free clusters. At the present time this hypothesis 
needs further investigation and we refer the reader to the chapter of Pastor, 
in the present book. 
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Clearly, a systematic study is needed to investigate the effect of ECP 
vs. all electron basis set, the effect of various gradient correction schemes, 
the effect of a given grid type and size, convergence criteria, etc. on the 
geometrical description of Ni 4 and Nis. The various DFT programs have 
to be compared and the “numerics” must eventually converge to the same 
results. This is essential to assess the accuracy of DFT for the description of 
the difficult case of transition metal clusters and to propose a computational 
protocol. 

10.3 The conversion of acetylene to benzene on Fe clusters 

In the last example, KS-DFT is used to study a chemical reaction involving 
a transition metal cluster in the gas phase. The absence of solvent in a gas 
phase experiment allows a better comparison with theoretical studies which 
are usually performed in a reference system corresponding to the vacuum 
at 0 K. 

The formation of benzene on naked transition metal clusters in the gas 
phase has been observed from acetylene on Fe"*" [174] and from ethylene 
on Fe^ [175-178], W"*" [179] and IT*" [180]. The reaction mechanism is 
unknown but reaction intermediates have been observed. The formation 
of benzene from ethylene implies the dehydrogenation of ethylene to form 
benzene. We think that a detailed study of the formation of benzene from 
acetylene on iron clusters, Fe+ (n = 1 — 4), can served as a good model for 
ethylene and other transition metal clusters in the gas phase. To do so, we 
have studied the reaction mechanism presented in Figure 6. Results on the 
cyclodimerization of two acetylene units (step D in Fig. 6) will be discussed 
in more detail. 

All-electron calculations were performed with deMon-KS [85-87] which 
uses the Linear Combination of Gaussian Type Orbitals-Kohn-Sham- 
Density Functional method (LCGTO-KS-DFT). We used Perdew and Wang 
for exchange [45], and Perdew for correlation [47,48], denoted PW86P86. 
Moreover, we have also included the results obtained with the LAPS corre- 
lation functional [66] synchronized with the popular GGA exchange func- 
tionals of Becke [46] (BLAP3) and Perdew and Wang for the exchange [45] 
(PLAP3). The orbital basis sets were (63321/5211*/311-k), (7111/411/1*) 
and (41/1*) respectively for iron, carbon and hydrogen atoms. The basis 
sets were optimized as described in [118]. The density and the exchange- 
correlation potential were fitted using the auxiliary basis set; (5,5; 5,5) 
for iron, (4,3; 4,3) for carbon, and (3,1; 3,1) for hydrogen. Following the 
notation (^ 1 ,^ 2 ; h,h), ki (h) is the number of s-type Gaussians in the 
density (exchange-correlation) basis and ^2 (^ 2 ) is the number of s-, p-, 
and d-type Gaussians constrained to have the same exponent in the den- 
sity (exchange-correlation) basis. During the SCF calculation, the density 
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Fig. 6. Schematic representation of the mechanism studied for the formation of 
benzene from acetylene on Fe"F. 



was fitted analytically while the exchange-correlation potential was fitted 
numerically on a grid containing at least 1066 points per atom. At the 
SCF convergence, the exchange-correlation potential is evaluated on a grid 
containing more than 4000 points per atom. This is done to evaluate 
the final exchange-correlation energy and the contribution of the 
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exchange-correlation energy to the gradient accurately. In the angular grid 
each shell of angular points is rotated randomly. This procedure destroys 
slightly the symmetry of the molecule. In our cases, the difference is a few 
thousandths of an Angstrom for interatomic distances and 1 degree for the 
angle, which are acceptable values. We choose this option because, some- 
times, it facilitates the SCF convergence. 

In order to obtain some minima on the potential energy surface, each 
geometry of Figure 6 (and many others) has been fully optimized, with- 
out symmetry and geometric constraints, for different multiplicities (differ- 
ent number of unpaired electrons) using the Broyden-Fletcher-Goldfarb- 
Shanno algorithm [128]. During the optimization, the norm of the gradient 
and the maximum atomic contribution to the norm of the gradient was mini- 
mized below 10“^. The convergence criteria for the density was fixed to 10“® 
while it was 10“^ for the energy. Vibrational analysis was performed, in 
the harmonic approximation, for all the optimized structures. If one or sev- 
eral imaginary frequencies were detected, the structure was perturbed along 
each mode and re-optimized. The second derivatives were calculated by fi- 
nite differences of the analytical gradient using a displacement of 0.03 a.u. 
Transition states have been located with GAUSSIAN94 [171] using the com- 
bination of B88 [46] for exchange and P86 for correlation [47,48] (B88P86). 
We should mention that the density and the exchange-correlation potential 
are not fitted in GAUSSIAN94. This could introduce very small difference 
between the results obtained by both programs. 

Reaction profiles for the cyclodimerization of two acetylenes on an iron 
cation computed with the B88P86 functional are presented in Figure 7. 
Gorresponding values for PW86P86, PLAP3 and BLAP3 are collected in 
Table 11. Ns corresponds to the number of unpaired electrons. The lowest 
energy structure for the adsorption of a single acetylene possesses three un- 
paired electrons. The addition of a second acetylene will bring us on the path 
represented by a dotted line (Ng = 3). The energy gain coming from the 
addition of a second acetylene is larger than any of the activation energies 
involved in this step of the reaction. Gonsequently, Fe(G 2 H 2 ) 2 "'’ possesses 
enough internal energy to cross the barriers. Supposing that we will stay on 
this path until the addition of a third acetylene (step G and F in Fig. 6), one 
can clearly see from Figure 7 why the cyclodimerization of two acetylenes 
does not seem to be important experimentally [174], the energy increases 
when going from Fe(G 2 H 2 ) 2 ~'" to any of the cyclic forms, Fe(n-G 4 H 4 )+ or 
Fe(c-G 4 H 4 )+. It is possible to go on the second path {Ng = 1), and observe 
cyclodimerization of acetylenes, only if spin-orbit coupling is important or 
if the first acetylene molecule binds to a highly excited iron cation. Since 
the formation of a cycle, Fe(n-G 4 H 4 )"'' or Fe(c-G 4 H 4 )“*', was not observed 
quantitatively in the experiment, we think that both situations are not 
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Relative energy (kj/mol) 




Fig. 7. Relative energy (in kJ/mol) for the cyclodimerization of acetylene on Fe”*" 
obtained with B88P86 exchange-correlation functional. The solid line and the dot 
line correspond to Ns = 1 and Ns = 3 respectively. 

frequently met. All the attempts to find a transition state to connect di- 
rectly the Fe(C 2 H 2 )J and Fe(c-C 4 H 4 )+ complexes have failed so far or end 
on TSl or TS2. For the moment, we do not rule out this path but we 
get the same problem when we tried to find a transition state between the 
Fe(C 2 H 2)3 and Fe(C 2 H 2 )(c-C 4 H 4 )+ complexes. 

Comparing different exchange-correlation functionals (Tab. 11), the cy- 
clodimerization of acetylene seems to be possible with FLAPS and BLAP3. 
In contrast to PW86P86, BLAP3 and B88P86 where Fe(c-C 4 H 4 )+ with 
A"s = 1 is the most stable structure by many kJ/mol, PL APS gives four 
nearly degenerate structures. We notice a variation of 50 kJ/mol at most 
in the relative energies for a given complex and a given value of Ng which 
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Table 11. Relative and activation energies (in kj/mol) for the cyclodimerization 
of acetylene on Fe^ computed with various functionals. 





Ns 


PW86P86 


PLAP3 


BLAP3 


B88P86 




1 


46.9 


65.1 


73.1 


59.9 


Fe(G2H2)2+ 


3 


0.0 


0.0 


0.0 


0.0 




5 


168.0 




164.9 


169.7 


TSl 


1 


113.2“ 


146.3“ 


128.0“ 


114.5 




3 


132.0 


163.1 


140.1 


114.8 




1 


0.2 


-5.9 


-7.9 


21.4 


Fe(n-G4H4)+ 


3 


2.7 


-3.6 


-5.6 


8.0 




5 


73.8 


65.4 


59.4 


75.9 


TS2 


1 


125.5 


165.4 


152.6 


126.1 




3 


130.6“ 


148.5“ 


141.9“ 


127.4 




1 


-37.1 


-4.1 


-28.3 


-26.0 


Fe(c-G4H4) + 


3 


37.6 


59.2 


35.7 


33.4 




5 


109.5 


109.3 


93.2 


137.1 



“Energy calculated at the B88P86 optimized structure. 



corresponds to the 0.5 eV for the margin of uncertainty introduced in the 
previous section for GGA functionals. The correct energetic description of 
these complexes may require multiconfigurational treatment which is in- 
cluded formally in KS-DFT method through the exact density but is not 
the case for approximate exchange-correlation functionals. We are currently 
performing calculations using the Goupled Gluster Singles, Doubles and 
Triples^^ (GGSD-T) method on that step of the reaction to better assess 
the energetics of the reaction. We should mention that some of the relative 
energies of the transition states were obtained by an evalutation of total 
energy at the B88P86 optimized structure. The difference between these 
values and the ones obtained from a complete optimization was only a few 
kJ /mol for the other ones where we have optimized the structure, thus this 
difference is negligible compared to the 50 kJ /mol of uncertainty. 



^^The triple excitations are included non-iteratively. 
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Table 12. Optimized structure (distances in Angstroms and angles in degrees) 
and freqnencies (in cm“^) of the TS corresponding to the cyclodimerization of 
acetylene on Fe^, TSl, computed with various functionals (As = 3). 





PW86P86 


PLAP3 


BLAP3 


B88P86 


r(PeCi) 


1.925 


1.928 


1.924 


1.925 


r(CiC2) 


1.303 


1.294 


1.288 


1.303 


r(C2C3) 


1.901 


1.854 


1.854 


1.822 


r(CiHi) 


1.090 


1.073 


1.072 


1.089 


r(C2H2) 


1.090 


1.074 


1.073 


1.092 


0(PeCiC2) 


76.0 


77.4 


77.2 


77.0 


0(C2PeC3) 


55.1 


52.8 


53.6 


52.6 


0 (HiCiC 2) 


140.4 


139.6 


139.9 


139.0 


0(01020304) 


-75.0 


-67.7 


-70.0 


-66.1 


V 


-365 


-383 


-344 


-324 



The differences between the optimized structure for the minima com- 
puted with various functionals are less than 0.01 Angstrom for bond lengths 
and 1 degree for angles. These values should be taken as error bars in our 
calculations. The optimized structure of TSl (Ag = 3) is presented in 
Table 12. The difference in the bond lengths calculated with various func- 
tionals is less than 0.01 Angstrom except for the bond that is formed during 
the cyclodimerization process (C2-C3). The dihedral angle corresponding 
to the departure from planarity of the carbon skeleton is also very sensi- 
tive to the functional. There is a difference of 60 cm“^ in the frequency 
corresponding to the transition state mode. This value is bigger than the 
differences observed for the minima and for TS2 (Ag = 1). When com- 
paring PW86P86 with B88P86 and PLAP3 with BLAP3 (z.e. changing 
the exchange functional) we see that the exchange energy functional is re- 
sponsible for 2/3 of the difference and the remaining difference is due to 
the correlation functional. Clearly, further work is needed before DPT can 
claim universal quantitative accuracy for transition-metal reactions. 

11 Conclusions 

The creation of very accurate exchange-correlation energy functionals and 
efficient implementations makes DPT a very useful computational tool for 
chemical applications. But, there is still room for improvement. Chemical 
accuracy, errors less than 1 kcal/mol, is still a challenge and there is an 
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increasing need for correlation energy functionals suitable to use in combi- 
nation with exact KS exchange. The meta-GGA seems to be a promising 
alternative to the hybrid schemes that require more computational effort. 
The example of V3 showed that a proper combination of experiment and 
theory can somethimes be the key to success. Nickel clusters were used to 
illustrate the need to carefully choose the computational details. KS-DFT 
can be used for the determination of reaction mechanisms but more work is 
needed for the determination of relative energies. 
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M. Brack 



Abstract 

We review semiclassical methods of determining both average trends 
and quantum shell effects in the properties of finite fermion systems. 
I. Extended Thomas-Fermi model (ETF): the average, 

selfconsistent mean field can be determined by density variational 
calculations using the semiclassical gradient-expanded ETF density 
functional for the kinetic energy. From this, average ground-state 
properties such as binding energies, densities, separation energies, 
etc. can be derived. II. Periodic orbit theory (POT): quantum oscil- 
lations in a mean-field system can be obtained from the semiclassi- 
cal trace formula that expresses the quantum-mechanical density of 
states in terms of the periodic orbits of the corresponding classical 
system. Only the shortest periodic orbits with highest degeneracy 
are important for the coarse-grained level density, i.e., for the gross 
shell effects. Particular uniform approximations are required to treat 
systems with mixed classical dynamics due to the effects of symme- 
try breaking and orbit bifurcations. III. Local-current approximation 
(LCA): the collective dynamics of the fermions can be described in 
linear- response theory, approximating the particle-hole excitation op- 
erators semiclassically by local current distributions. The method is 
suitable in combination with both the ETF density variational ap- 
proach or with the Kohn-Sham density functional approach for the 
ground state, and allows one to describe optic response properties 
such as static polarizabilities and plasmon resonances. Applications 
of all methods to metal clusters and various mesoscopic nanostruc- 
tures are given. 
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1 Introduction 

We want to describe properties of finite fermion systems semiclassically. We 
start from a mean- field approach where the particles move independently 
in a selfconsistent potential. There are essentially two standard methods to 
derive such a selfconsistent potential (see, e.g. [1]): i) the Hartree-Fock (HF) 
theory in which the exchange is treated exactly, and ii) the Kohn-Sham 
(KS) density functional theory (DFT) in which the exchange often is treated 
only in the local density approximation (LDA), but other correlations can 
also be included. In the following, we write the mean-field Hamiltonian as 
the sum of the kinetic energy operator and the selfconsistent mean field V 
(that includes possible external fields) 

H = f + V. (1.1) 

The single-particle energies Si and wavefunctions ipi are given by the (HF 
or KS) Schrodinger equation 

H(fi^{r) = Eiipiir) . ( 1 . 2 ) 

In HF theory, V contains also the non-local Fock exchange term, whereas in 
KS-DFT theory V = K(r) is a local potential (the total KS potential, which 
includes also an exchange-correlation part). For simplicity we assume that 
the spectrum £j is discrete. 

The fundamental quantity that contains all the quantum information of 
the energy spectrum is the single-particle level density 

g(E) = tr 6{E - H) = ^ (z|<5(A - H)\i) = ^ 

i i 

where 6{x) is the Dirac delta function. The summation over i here goes 
over the complete spectrum including all degeneracies. 

Following very general arguments of Strutinsky [2] , the level density may 
be split into an average part g{E) and an oscillating part Sg{E): 

g{E)=g{E) + Sg{E). (1.4) 

Although the two parts arise naturally within the periodic orbit theory 
(POT), as will be discussed in Section 3, it is customary to consider them 
separately. The average part g may be calculated numerically from the 
by the Strutinsky averaging method [2]. It is equivalent [3,4] to its semiclas- 
sical approximation obtained in the extended Thomas-Fermi (ETF) model 
which will be discussed in Section 2. The oscillating part Sg, which con- 
tains all the quantum shell effects coming from the non-uniform distribu- 
tion of the single-particle levels Si (including their degeneracies), is obtained 
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semiclassically in the POT discussed explicitly and illustrated in Section 3. 
A semiclassical approach to the collective excitation spectrum, using a local- 
current approximation (LCA), is finally presented in Section 4. 

2 Extended Thomas— Fermi model for average properties 

The ETF model allows one to obtain average properties such as binding and 
deformation energies, separation energies, etc., that do not include quantum 
oscillations. The main idea of the TF model is to take an average over the 
classical phase space with a unit occupation probability per unit cell h^, 
where h is Planck’s constant (we assume in the following a 3-dimensional 
system), multiplied by a factor 2 to take the spin degeneracy into account. 
Including finite 7i-correction terms yields the extended TF model that con- 
tains average quantum corrections without, however, generally allowing for 
quantum shell effects. 

2.1 Thomas-Fermi approximation 

The TF approximation for the level density is obtained by taking the trace 
in (1.3) in the phase space: 

gTF(E) = (5[F- i/ci(r,p)]d^rd3p, (2.1) 

where we have replaced H by the classical Hamiltonian Hc\ 

Hc\{r,p) = + V{r). (2.2) 

2m 

If the mean field in (1.1) contains non-local terms {e.g., the Fock term in the 
case of HF theory), then the potential will also be momentum dependent, 
V = P(r,p), and can be obtained from (1.1) by a Wigner transform [1]. In 
the following we assume V (r) to be local and analytical. In the ground state 
of a mean-field system, all levels up to the Fermi energy E-p are occupied. 
The total number of particles N is then given by the integral 

pEp 2 r 

N = g{E) dE=—je [Ep - Tfei(r, p)] dV d^p, (2.3) 

where 0(a:) is the Heavyside step function. The momentum integration can 
be done immediately with (2.2), using its spherical symmetry in momentum 
space, and yields 
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Omitting the spatial part of the integral in (2.4) yields the TF approxima- 
tion of the spatial density: 

PTF(r) = ^ j [Af - h^(r)] 0 [Af - h^(r)] . (2.5) 

Note the step function which implies that the TF density is zero outside the 
classical turning points defined by Ap = C(r^). In the same way we can 
also obtain the total kinetic energy Akin of the system by the integral 

Akin = y 2^ ® “ Aci(r,p)] d^rd^p (2.6) 

which after momentum integration yields 

Akin = ^ (I?) V 
SO that the kinetic energy density becomes 

TTF(r) = ^ [Ap - y(r)] 0 [Ap - C(r)] . (2.8) 

From equations (2.5, 2.8) we can eliminate the Fermi energy and the poten- 
tial to obtain the TF functional for the kinetic energy density 

ttf[p] = ^ , with K = ^ (Stt^)^/^. (2.9) 

Equations (2.5) and (2.8, 2.9) are the expressions which were used by 
Thomas and Fermi [5] to describe the electronic density and energy of the 
atom semiclassically in a selfconsistent way (see, e.g., the book by March [6] 
for a very nice presentation). Although the TF model gives a nice qualita- 
tive description of the average properties of an atom, it is not very good. 
(It has been shown, though, that the total TF energy for the neutral atom 
becomes exact in the mathematical limit of infinite nuclear charge Z —>■ oo; 
see [7] for a review). It is possible, however, to improve the TF model by 
taking into account a series of quantum corrections, leading to the so-called 
“extended Thomas-Fermi” (ETF) model which we will summarize in the 
following subsections. 

2.2 Wigner-Kirkwood expansion 

There are various ways to derive the quantum corrections to the TF model; 
we use the method developed by Wigner and Kirkwood [8] which starts 
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from expansion in powers of h of the single-particle Bloch density matrix 
around its classical TF value. We refer to a recent monograph [4] for a 
detailed presentation of this method and summarize just the general idea 
and results. The basic idea is to relate the quantities of interest to the 
Bloch density matrix which is defined as the statistical operator in 

coordinate-space representation: 

C{r,r'] P) =Y^ip*{r') , (2.10) 

i 

where the sum goes again over the complete spectrum. Here /3 is a complex 
mathematical variable. The inverse Laplace transform (see, e.g. [9,10]) of 
C'(r,r';/3) yields the single-particle density matrix 

= ^eI 4c(r,r';/3) 

ei<Ep LP 

1 pc+ioo 1 

= ^ -C(r,r';/3)e^^"d/3 (c> 0). (2.11) 

From this one obtains easily the local density 

P(r) = l‘P*(r)P = P(r,i') (2-12) 

€i<E^ 

and the kinetic energy density 

'r(r) = -^ Y ‘PtW A(^i(r) = -^ Arp(r,r')|r'=r, (2.13) 

where the subscript r here and in the following indicates that an operator 
acts only on the variable r on its right. Note that these are so far the exact 
quantum- mechanical expressions. The TF approximation and its extensions 
may now be obtained in the following way. Take the matrix element in (2.10) 
in a plane- wave representation (including the spin factor 2): 

C(r, r'; P) = ^ J e-*P '’'A e"^^- e^P'^A. (2.14) 

Due to the potential V^(r) in (1.1), the plane waves are no eigenfunctions 
oi H . If we replace Hr in the exponent above by the classical Hamilton 
function Hc\{r,p) and perform the p integration, we obtain the classical 
Bloch density matrix 

/ \3/2 r 1 
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Inserting this into equation (2.11) yields the TF densities. To obtain quan- 
tum corrections to the TF results, one starts from the ansatz 

e-^^e*P'’/^ = e-^"‘=‘('’’P)e*P'’/''u>(r,p;/3), (2.16) 

and expands w{r, p; (3) into a series in powers of h: 



w = 1 -I- hwi + fi^W2 + ■ ■ ■ (2-17) 

Hereby the are functions of r, p, and P, containing up to n gradients 
acting on the potential C(r). The series (2.17) may, e.g., be obtained by 
expanding the left side of equation (2.16) in terms of repeated commuta- 
tors of the kinetic energy operator with H(r) (see Ref. [1]). Another way, 
solving iteratively a differential equation for w(r,p; P) (the so-called Bloch 
equation), is described in [4]. Only the even terms of the series (2.17) with 
n = 2m, m = 1,2,3,... survive the p integration for the local density and 
kinetic energy density, and give the ETF corrections to (2.5) and (2.8). 
These correction terms diverge at the classical turning points, but they can 
be integrated to yield finite corrections to the particle number, the total 
energies and other integrated quantities. The densities obtained in this way 
by truncation of the series (2.17) at some order are usually called the ETF 
densities pETp(r) and TETF(r) (see, e.g. [4] for their explicit forms). 



2.3 Gradient expansion of density functionals 

Eliminating the potential and its gradients from the ETF densities, which 
takes some algebra (c/. [11] where this has been done systematically also 
for momentum and spin dependent potentials), one arrives at the kinetic 
energy-density functional tetf[p] including gradient correction terms. We 
quote here only its integrated form as it contributes to the total kinetic en- 
ergy functional Tetf [p] ■ Keeping terms up to fourth order in % one obtains 
after some partial integrations [12, 13] 



T^kin 

-k 



= TetfM = ^ 



6480 



(37r")-2/3pi/3 




1 (Vp)^ 

36 p 




P 



24 



(2.18) 




In the first term of (2.18) we recognize the TF functional (2.9). The second 
term, coming from the corrections, has been derived for the first time by 
von Weizsacker [14], albeit with a different coefficient. Equation (2.18) has 
the correct coefficient in the limit of slowly varying densities, whereas the 
original Weizsacker coefficient (1/4 instead of 1/36) is correct in the limit 
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of rapid density oscillations with small amplitude (see [15] for a detailed 
discussion). The terms shown on the second line in (2.18) come from the 
terms of the semiclassical expansion (2.17). Although the derivation of the 
functional (2.18) sketched above is valid only within the classically allowed 
region, it can be shown from its extension to finite temperature [16,17], 
where the ETF densities stay finite and are defined in the whole space, that 
the zero-temperature limit leads to equation (2.18) also in the classically 
forbidden region (see also [18, 19]). 

The functional (2.18) is only expected to be correct for average densities 
without quantum shell oscillations, since it has been derived in a semiclas- 
sical framework that has no room for shell effects. Nevertheless, it has been 
used in connection with atomic HF densities [20] and found to reproduce the 
total kinetic energies quite well. A careful study of (2.18) using numerically 
Strutinsky-averaged quantum-mechanical densities p(r) for various forms 
of the potential F(r) was done in [21]. The Strutinsky-averaged kinetic 
energies T were found to be well reproduced when all terms up to fourth 
order in (2.18) were taken into account, as is demonstrated in Figure 1. 
The dashed curve in the right panel of the figure shows the exact quantum- 
mechanical kinetic energy which exhibits pronounced shell effects; the FTF 
functional (2.18) clearly serves to reproduce only the average part of the 
kinetic energy. 

Similarly, one obtains also gradient corrections to the exchange-correla- 
tion energy functional occurring in DFT [1], or corrections arising in con- 
nection with nonlocal potentials {e.g., spin-orbit and effective mass terms; 
see [19] for the case of effective Skyrme interactions in nuclear physics). 
Note that nonlocal potentials give also corrections to the kinetic energy 
functional (2.18) [11,19]. Relativistic functionals Tetf[p] have also been 
derived [1,22,23]. 

2.4 Density variational method 

Within the framework of the density functional theory (DFT), the 
Hohenberg-Kohn theorem [24] tells us that the total energy of a fermion 
system is uniquely given as a functional of the local density p(r) . We write 
it here as the sum of kinetic and potential parts: 

Etot = Ts[p] + Epot[p] = j {r(r) -|- £pot[p]} d^r, (2-19) 

where Ts[p] is the kinetic energy of a non-interacting system with den- 
sity p(r) and £’pot[p] the sum of all potential energy terms including the 
contribution from any external one-body potential 14xt(r)) which just gives 
a term 14xt(r)p(r) under the integral above, and the exchange-correlation 
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Fig. 1. Tests of the kinetic energy functional tbtf[p] (2.18). Left: deformed 
Woods-Sa:xon potential with N = 126 nucleons (after [13]; see the definition there 
of the deformation parameter c). Right: axially symmetric harmonic-oscillator 
potential with frequency ratio q with N = 112 particles (after [21]). The index 
in T 2 m['p\ gives the maximum order of Ti corrections included (m = 0 is TF). T is 
the Strutinsky-averaged kinetic energy (not distinguishable from T 4 [p] on the r.h. 
side). Note the shell effects in the exact kinetic energy T shown by a dashed line. 



energy (that also takes into account the interacting part of the kinetic en- 
ergy, see [1]). The exact functional Ts[p] is not explicitly known; therefore 
one has to express it via r(r) in equation (2.13) in terms of single-particle 
wavefunctions </^i(r) and vary the total energy with respect to these, which 
leads to the Kohn-Sham (KS) equations. 

If we are not interested in the quantum shell effects, we can replace 
Ts[p] by the semiclassical ETF functional (2.18), so that the energy (2.19) 
becomes an explicit functional of p(v). We can then directly apply the varia- 
tional principle by making E[p] stationary with respect to an arbitrary local 
variation Sp{r) of the (average) density. To keep the number of particles N 
constant we include a Lagrange multiplier p, which is just the Fermi energy 
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of the average system. This yields the Euler-Lagrange variational equation 



6 

Sp{r) 



E[p{r)] - M 




= 0 . 



( 2 . 20 ) 



Its solution gives the average ground-state energy and density of the sys- 
tem within the ETF approximation. To keep the notation simple, we still 
write p(r) but keep in mind that p{v) and E[p] only represent the average 
quantities. The variation of the potential energy gives, quite generally, the 
local mean-field potential V (r) which depends in a selfconsistent way on the 
density itself 

^Epot[p(r)]=E[p(r)]. (2.21) 

V[p] is formally identical to the total KS potential (except that it is evalu- 
ated here for the average density); it depends, of course, on the particular 
system under consideration and on the two-body interaction. The variation 
of the kinetic energy part, however, is universal and can be calculated di- 
rectly from the explicit form of the ETF functional given above. To avoid 
lengthy formulae, we give here explicitly only the result obtained up to the 
Weizsacker term. Assuming the density variation Sp{r) to vanish on the 
boundary (which usually is at infinity) and using the standard techniques 
of variational calculus, one gets the following differential equation 



2m 




1 

36 



{WpY 




+ D4[p] + V[p] — p , , 



( 2 . 22 ) 



where D 4 is the contribution from the fourth-order gradient term, D 4 [p] = 
5T4[p\I 5p, whose explicit form (containing seven terms with up to fourth- 
order gradients of p) may be found in [19]. Equation (2.22) is a partial 
differential equation which in general can only be solved numerically. To 
lowest TF order, neglecting all gradient terms, the solution of (2.22) for 
the density p(r) is algebraic and gives, as it should, exactly the TF den- 
sity pTF(r) given in (2.5). 

If the gradient terms are kept in equation (2.22), the nature of the so- 
lutions depends on the behaviour of the potential E(r). For the following 
discussions and applications, we shall assume that V (r) is finite in the inte- 
rior of the system and goes to zero at large distances. Then the asymptotic 
fall-off of the density is always governed by the highest gradient term in- 
cluded in the variational equation (2.22). (See [1] for the problems that 
arise in atomic systems from the 1/r divergence at the center due to the 
external nuclear potential). Keeping just the Weizsacker term, the solutions 
for the density are found to decay exponentially. This is because the con- 
stant Fermi energy p can only be cancelled by the highest gradient term in 
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equation (2.22). The rate of the exponential fall-off turns, however, out to 
be too fast in comparison to that of exact quantum- mechanical densities. 
One pragmatic way out of this problem is to artificially increase the coef- 
ficient of the Weizsacker term, adjusting it by optimizing the results. The 
dilemma hereby is that the same constant cannot be used for densities and 
for integrated energies. Nevertheless, this procedure has been widely used 
in physics and physical chemistry (see [1] for details and applications). It 
violates, however, the aspect of universality which is one of the nice features 
of the ETF model, and can easily be avoided by including the higher-order 
gradient terms in equation (2.22). 

Including the fourth-order contribution [p ] , the density of a spherical 
system falls off only with the sixth inverse power of r, i.e., p{r) ~ r“®. More 
generally, the highest derivative term in T 2 m[p] goes radially like [21] 

r.Mr)\ ~ b(--)l (2'23) 

and the solution of equation (2.20) is found to fall off like 

p{r) ~ ^ qq (to > 2). (2.24) 

Inserting this into (2.23), one finds that the selfconsistent kinetic energy 
density r(r) goes to zero with the same power law as p{r) in (2.24). It 
can therefore always be integrated to give a finite kinetic energy. (The 
fact that the sixth- and higher-order gradient corrections to tetf[p] yield 
diverging integrals for densities that decay exponentially like p(r) ~ e~“’’ 
has repeatedly been noted in the literature and interpreted as a break down 
of the ETF gradient expansion. This is an erroneous conclusion, since the 
selfconsistent solutions of (2.22) for to > 2 do not fall off exponentially.) 

The inverse power law (2.24) found for to > 2 is, of course, not realistic 
for most systems, as the quantum-mechanical density usually falls of expo- 
nentially. It has turned out, however, that the asymptotic solution (2.24) 
is reached only mathematically very far away from the actual surface of a 
realistic system, and that the inclusion of the fourth-order term definitely 
improves the variational results. This has been shown in numerical solu- 
tions of the ETF variational equation (2.22) up to fourth order, both for 
atomic nuclei [18,25] and for metallic clusters within the jellium model [26]. 
Spherical symmetry was assumed in both cases so that the differential equa- 
tion was only one-dimensional. The applications to metal clusters will be 
discussed explicitly in the remainder of this section. 

One basic problem of the ETF variational method is that there is ef- 
fectively no lower bound to the total energy, different from the Hartree- 
Fock method where this is granted by the Ritz principle. A priori, the 
Hohenberg-Kohn theorem [24] tells us that the total energy (2.19) has its 
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variational minimum at the value of the exact energy of the system. But 
this holds only true if the exact density functional E[p] is used which, how- 
ever, is unknown. Due to the approximations necessary to put the density 
functional method into use - and here in particular due to the ETF ap- 
proximation - this lower bound is no longer given. As a consequence, the 
variational ETF results are typically overbound, z.e., the (absolute) bind- 
ing energies are too large [19,25]. The amount of overbinding in nuclei is, 
however, much smaller than one percent of the total energy. 

We should like to emphasize again that due to the very nature of the 
ETF model, the Euler equation (2.22) cannot reproduce any of the quantum 
shell oscillations typical of finite fermion systems. The ETF variational 
method is thus limited to yield average properties of finite fermion systems. 
A perturbative way to include the shell effects, without going through the 
fully self-consistent microscopic HE or KS scheme, makes use of Strutinsky’s 
shell-correction method (see Ref. [4]). 

The extension of the ETF model to systems at finite temperature that 
can be described by a grand-canonical ensemble is straightforward, though 
algebraically quite tedious. We refer to the literature [4, 17, 18] for its pre- 
sentation and, in particular, to [18] where the functional Tetf[p] with its 
explicitly temperature-dependent coefficients can be found, as well as ap- 
plications to nuclear physics. An example of the results obtained there is 
given in Figure 5 below. 



2.5 Applications to metal clusters 

2.5.1 Restricted spherical density variation 

The numerical solution of the nonlinear fourth-order differential 
equation (2.22) becomes very difficult, if not practically impossible, for de- 
formed systems without symmetry. For that case one may perform the 
variation in (2.20) not exactly in r space, but in a restricted space of pa- 
rameterized variational densities. The choice of the variational density is 
then a matter of physical intuition and, of course, not free of a certain bias. 
A spherical density profile that has proved useful for both nuclei [19] and 
metal clusters [27, 28] is a generalized Fermi function: 



p{r) 



Po 

[l + exp(^)]^' 



(2.25) 



Of the four parameters above, only three are independent due to the con- 
straint of the particle number conservation. The independent parameters 
are then determined variationally by minimizing the total energy of the sys- 
tem under consideration, and no adjustable parameter is used anywhere. 
With this restricted variational procedure up to fourth order in the ETF 
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Fig. 2. Comparison of selfconsistent densities and potentials for the spheri- 
cal Naigs cluster obtained in the jellium model. V (r) is in Rydberg units, p(r) 
in units of the ionic bulk density pg- Solid lines: quantum-mechanical KS re- 
sults [29]. Dashed lines: semiclassical ETF results including up to 4th-order 
gradient terms, and dashed-dotted lines: only up to 2nd-order gradient terms in 
the functional (2.18). (From [27]). 



functionals, density profiles and energies were obtained that agree very 
closely with the fully variational ETF results [25, 26] on one hand (see the 
following subsection), and with the averaged results of fully microscopic KS 
calculations for metal clusters [29,30] and HE calculations for nuclei [19,25] 
on the other hand. In particular, the profiles of the variationally optimized 
densities (2.25) match the selfconsistent quantum-mechanical ones over the 
entire region of the surface, demonstrating that the mathematically inade- 
quate asymptotic inverse-power decays (2.24) of the exact solutions of (2.22) 
have no practical significance. 

As an example we show in Figure 2 the densities and potentials obtained 
in a large sodium cluster within the spherical jellium model (see, e.g. [31] 
for a detailed discussion of this model), both by solving the fully quantum- 
mechanical KS equations [29] and by the ETF density variational method 
using the profile (2.25) [27]. When including the 4th-order gradient terms 
in (2.18), the ETF solution agrees perfectly with the KS solution in the 
outer surface region, whereas the 2nd-order solution falls off too rapidly (as 
mentioned above). Note the shell oscillations in the quantum results. 
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Fig. 3. Upper part (a): energies per atom E/N in units of the bulk value en; 
lower part (b): ionization potentials IP (in eV) for spherical jellium clusters with 
Cs = 4 a.u., shown versus cluster radius R\. Wh is the bulk work function. Part (c) 
gives the electrostatic parts of the ionization potentials. Dots and small circles 
are the quantum- mechanical KS results [29] ; the solid lines are the 4th-order ETF 
results [27]. (After [27]). 



Figure 3 shows a similar comparison for (a) the total energy per particle 
E/N and (b) the ionization potential IP for neutral spherical sodium clus- 
ters with N atoms obtained in the jellium model, plotted versus the cluster 
radius i?i = (with a Wigner-Seitz radius of Ts = 4 a.u.). Again 

we see that the ETF results are smooth curves that miss the periodic shell 
effects but represents the correct average behaviour. The asymptotic limit 
of the IP for large clusters is the bulk work function Wb- Note that the 
electrostatic part of the IP shown at the bottom (c) oscillates much less 
than the other energies. The fact that the semiclassical total energy seems 
to lie lower than the average of the quantum results is due to the imposed 
spherical symmetry. Indeed, the minima of the quantum results correspond 
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Fig. 4. Ionization potentials IP and electron affinities EA of aluminum clusters 
with N atoms, plotted versus the inverse cluster radius. Black diamonds: exper- 
imental results, solid lines: calculated ETF values in the spherical jellium model 
with Ts = 2.99. (From [32]). 



the spherically “magic” clusters that have filled shells; in the regions in 
between, the clusters have only partially filled spherical shells and want 
to be deformed due to the Jahn-Teller effect. If a spontaneous symmetry 
breaking of the mean field is allowed in the KS calculation, their energies 
are systematically lowered, so that the ETF results come to lie much closer 
to the averaged KS results (see [31] and Sect. 3.5.2 for a discussion of the 
deformation effects). 

A comparison of experimental ionisation potentials IP and electron af- 
finities EA (see their definitions in Eq. (2.27) below) of aluminum clus- 
ters with the variational ETF results [32] , obtained in the spherical jellium 
model, is shown in Figure 4 versus the inverse average cluster radius (which 
is proportional to The experimental results exhibit, of course, os- 

cillations that come both from electronic shell effects and from the ionic 
structure that cannot be described by the jellium model. It is quite amaz- 
ing that a simple semiclassical ETF calculation can capture the average 
experimental trends even down to the smallest molecules with N = 3, cor- 
responding to the rightmost experimental point. (Note, however, that the 
calculated values have been vertically adjusted to fit the experimental bulk 
work function Wh at the intersection with the vertical axis, since the jellium 
model yields a wrong value of Wh- In this way, the average effects of the 
ionic structure has been taken into account implicitly (see [32] for details) . 
See also [33] for improved results in the “stabilized” jellium model). 
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Z(fm) 



Fig. 5. Density profiles p{z) for an interface of nuclear matter and a gas of 
nucleons in thermal equilibrium, calculated at the temperatures T = 0, 5, and 
10 MeV. it Solid lines: exact solutions of the Euler-Lagrange equation (2.22) up 
to fourth order; dashed lines: results using a restricted parametrization of p{z) 
corresponding to equation (2.25). (From [18]). 



2.5.2 Unrestricted spherical density variation 

A test of the parametrization (2.25) of the spherical density profiles has 
been provided by unrestricted density variational calculations of Engel and 
Perdew [26] in the framework of the spherical jellium model. These au- 
thors solved numerically the Euler-Lagrange equation (2.22) including the 
fourth-order gradient terms. Their density profiles p{r), and quantities de- 
rived from them, are extremely well reproduced by the results of the re- 
stricted variation using the profiles (2.25). A similar test was done in nu- 
clear physics [18] using the finite-temperature version of the ETF model. In 
Figure 5 we show the density profiles p{z) across the plane interface (po- 
sitioned at z = 0) of nuclear matter in thermal equilibrium with a gas of 
nucleons, calculated for three different temperatures. The solid lines give the 
results obtained from the exact numerical solutions of the Euler-Lagrange 
variational equations (in one-dimensional planar geometry); the dashed lines 
are obtained using a restricted variation with the one-dimensional analogue 
of the parametrization (2.25). 

When the ionic structure is included in the theoretical description of 
metal clusters, the electronic density profiles can no longer be given by 
simple parametrizations, and unrestricted density variations become a ne- 
cessity. This has been done by various groups [34-37] who performed vari- 
ational (E)TF calculations for the electrons, in connection with molecular 
dynamics for the ions in the Born-Oppenheimer approach. As an example 
of the results of such calculations, we show in Figure 6 the density pro- 
file of the highly deformed charged cluster Na^'g’^, which has been called a 
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z(ao) 

Fig. 6. Ground-state electron density p{z) of the fission isomer of Na]*'g^, computed 
quantum-mechanically in Kohn-Sham DFT (heavy line) and semiclassically in TF 
theory (thin line). (From [35]). 



“fission isomer” [35]. It has an approximate axial symmetry, and the den- 
sity is shown here along the direction of the symmetry axis. The heavy line 
was obtained from solving the quantum-mechanical KS equations, whereas 
the thin line is obtained in the simple TF approximation. 



2.5.3 Liquid drop model for charged spherical metal clusters 

We finally mention a useful application of ETF variational calculations to 
derive liquid-drop model (LDM) expansions of the total energy or other 
observables of a fermion system. When its density is of the form seen in 
Figures 5 or 2 with a constant saturated value po in the interior and a well- 
defined surface region a around the radius R where it falls off most steeply, 
one can perform analytically a so-called “leptodermous” expansion [19, 38] 
of all integrated quantities in powers of the small parameter a/R. For 
metal clusters this has been done systematically by Seidl [28, 39] within the 
spherical jellium model. The total energy of a cluster with N atoms and Z 
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valence electrons is found to be: 

E{N,Z) = CbN + asN^^^ + acN^/^ + ao-Wb{Z - N) 

+ |a_i + ^ C{Z - N) + ^{Z - Nf 

+0 . (2.26) 
For the ionization potential IP and the electron affinity EA, defined by 
IP = E{N,N-1)-E{N,N), EA = E{N,N)~ E{N,N +1), (2.27) 
one finds the following expansions up to order 1/i? = 

IP(i?) = TPb + Q - ^ + O (i?-2) , 

EA(i?) = TPb- Q + ^ + 0(i?-2). (2.28) 

These expressions have been found to fit rather well the average experi- 
mental data for large clusters; the terms of order 0{R~^) become, however, 
particularly important for the electron affinities of small clusters [40] (see 
also Fig. 4 above). 

All the parameters appearing in equations (2.26, 2.28) can be related to 
bulk properties (energy per particle Cb, Wigner-Seitz radius Ts) or properties 
of the infinite planar metal surface such as the work function Wh, the surface 
tension ct in Og = 47rr^a, and other integrals [19,38] over the infinite surface 

density profile Poo{x) (x being the distance from the plane with a; = 0 at 

the jellium edge). In particular, the parameter C is given by 

2 a r 1 

C=-^ 47t / da; a;^poo(a;) + a;/(a;) , (2.29) 

3 e^/xs Jo L J 

where the function f{x) appears in the asymptotic expansion of the density 
profile pu{r — i?) of a large spherical cluster with radius R [26,33,41]: 

Pfl(x) = Poo(x) + ^ + 0(i?-2). (2.30) 

Note that C is not equal to 1/8, as was derived from a wrong classical 
image-potential argument which unfortunately still floats around in the lit- 
erature and in the heads of some researchers. (See [42], where the flaw in 
this argument has been uncovered and corrected). In fact, the quantity C, 
which in (2.28) above gives small deviations from the (correctly calculated) 
classical charging energy e^/2i? of a sphere, is of purely quantum-mechanical 
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origin with contributions from the kinetic and exchange-correlation energies 
of the electrons and the non-classical tail of their density. 

The LDM expansion has also been discussed in the framework of the 
so-called “stabilized jellium model” [33] which yields improved bulk work 
functions and surface energies, and therefore provides a better starting point 
than the simple jellium model. 



3 Periodic orbit theory for quantum shell effects 

In this section, we shall describe the semiclassical approach to quantum 
shell effects. Historically, the “old” quantum theory by Bohr [43] started 
from a semiclassical approximation to the energy spectrum of the atom, 
requiring the action integral of the electron on its supposedly classical or- 
bit around the nucleus to be an integer multiple of Planck’s constant h 
(see [44] for a remarkable historical account). This semiclassical quantiza- 
tion scheme was later developed into the WKB method (named after Wenzel 
et al. [45]) for one-dimensional systems and, more than 30 years later, into 
the EBK quantization or “torus quantization” scheme for more-dimensional 
integrable systems. This method is named after Einstein et al. [46], who 
revived the idea of semiclassical quantization using the very fundamental 
observation by Einstein [47] that the classical motion of an integrable sys- 
tem is bound to a torus in phase space. (See, e.g. [4] for a presentation of 
the basic ideas of semiclassical quantization). 

The EBK method does not apply if a system with N degrees of freedom 
is not classically integrable, i.e., if it does not possess the same number N of 
constants of the motion which are independent and which Poisson-commute 
pairwise. The question if a non-integrable system can be quantized semiclas- 
sically remained therefore unanswered for some time. A break-through was 
accomplished by Gutzwiller [48], who derived a “trace formula” that con- 
nects the oscillating part of the quantum-mechanical level density to a sum 
over all periodic orbits of a classically non-integrable system. Independently, 
similar trace formulae were derived later by Balian and Bloch for particles 
in a cavity with reflecting walls {i.e., for three-dimensional billiards) [49], 
and by Berry and Tabor for integrable systems. Starting from EBK quan- 
tization, these authors first showed that the sum over the EBK spectrum 
can be converted quite generally in a sum over periodic orbits [50], and 
later derived the same result starting from Gutzwiller’s semiclassical Green 
function [51] (see Sect. 3.1 below). Since the classical periodic orbits play a 
fundamental role in this semiclassical approach, it has been called “periodic 
orbit theory” (POT). Including a sufficient number of orbits in the trace 
formula, it can in principle yield the quantized energies of the system in the 
form of singularities (the peaks of the sum of delta functions). This works. 
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however, only in a limited number of cases as the trace formulae represent 
asymptotic series which usually do not converge [52]. 

The applications of the POT which we want to discuss here do not aim at 
the quantization, but at a semiclassical description of gross-shell features in 
mean-field systems, which can be obtained by convoluting the level density 
with a smooth function over a finite energy range 7. Then, the periodic 
orbit sums converge [53] and only the orbits with the shortest periods and 
largest degeneracies become important. 

We shall not derive here the trace formulae but refer to the original liter- 
ature. For the approach of Gutzwiller, the reader may consult a monograph 
by the same author [52] , and the very readable series of lectures which he 
presented a an earlier Les Houches Summer School [54] . The original trace 
formula of Gutzwiller [48] applies only to systems in which all orbits are iso- 
lated; this is the case for a classically chaotic system. A series of extensions 
to Gutzwiller’s theory to take into account degenerate orbits, which appear 
in systems that possess continuous symmetries, have been developed over 
the past 30 years [55,56]. Their basic ideas are discussed in [4]. 

In Section 3.1 we shall present the semiclassical Green function devel- 
oped by Gutzwiller, which is the basic ingredient to all further developments 
in the POT. In Section 3.2 we present the trace formulae for the level density, 
particle number and total energy of a mean-field system, and in Section 3.3 
we briefly discuss practical methods for finding periodic orbits and calcu- 
lating their stabilities. Section 3.4 contains a summary of the problems 
connected with symmetry breaking transitions and orbit bifurcations, and 
indicates that these can be overcome by uniform approximations. The re- 
maining parts are reserved for applications to metal clusters and mesoscopic 
semiconductor devices. 



3. 1 Semiclassical expansion of the Green function 

The quantum-mechanical level density g{E) given in equation (1.3) can be 
related to the imaginary part of the trace of the energy-dependent single- 
particle Green function G(r,r'; E) for a system in D dimensions: 

g{E) = -^^ J G{r,r;E + ie)d^r (e > 0). (3.1) 

The Green function itself is defined by 



1 



G(r,r'; A) = ^ <p*(r') </Ji(r) 



{E - Ei) 



(3.2) 
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and can be related to the single-particle propagator K{r,r';t) by a (half- 
sided) Fourier transform: 



G{r,r']E) = -- limj AT(r, r'; t) exp 



-(A -I- ie)t 



dt. 



(3.3) 



Semiclassical approximations to the propagator go back to Van Vleck in 
1928 [57]. Gutzwiller rederived Van Vleck’s approximation from Feynman’s 
path integral approach to quantum mechanics and improved it by the incor- 
poration of some important phases. The basic idea of his derivation is the 
stationary phase approximation, applied to integrals containing a rapidly 
oscillating phase. Doing also the Fourier transform in (3.3) by the station- 
ary phase method, he arrived at the following semiclassical approximation 
to the Green function [48]: 



Gsci(r, r'; E) ~ ^ ^/fD^\ exp 

{2inn) 2 ^ 



t 7T 

-Sa{r,r',E) - ifia- 



(3.4) 



The sum goes over all classical trajectories a that lead from the point r' 
to the point r at fixed energy E] the functions Sa{r,r',E) are the action 
integrals along these trajectories: 

S^{r,r';E)= [ p"- dr" = ^ T p,(r", A) dr", (3.5) 

z=l “'’■1 



where rt, pi are the components of phase-space vectors r and p, respectively. 
In equation (3.4), T>a is the determinant of the (D-l-l)-dimensional matrix 
of second partial derivatives of the action integral with respect to all its 
arguments 





d^So. 


dr' dr 


dr'dE 


d^Sa 


d^S„ 


dEdr 


dE^ 



(3.6) 



The quantities pa in (3-4) count the number of caustic (or “conjugate”) 
points along the trajectory a; they are connected to the Maslov indices 
appearing in the EBK quantization of integrable systems [46]. 



3.2 Trace formulae for level density and total energy 

It remains only to insert the semiclassical Green function (3.4) into the re- 
lation (3.1) and to perform the trace integral. Taking r' = r under the 
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integral automatically selects only closed orbits from the sum over trajecto- 
ries a. If these closed orbits are isolated in phase space, then all spatial inte- 
grals transverse to the orbits can again be done using the stationary phase 
approximation; only the integration along the orbit is done exactly. The 
stationary condition for the exponent in (3.4) leads, using equation (3.5), to 

dr 

where ro is the point where S becomes stationary. Equation (3.7) implies 
that the orbits are not only closed but periodic, i.e., they have to start and 
end with the same momentum: p' = p. The transformation to a coordinate 
system co-moving with each periodic orbit, the evaluation of the saddle- 
point integrations, and expressing the result in a canonically invariant form 
- all this requires quite some tedious algebra and finally leads to Gutzwiller’s 
famous trace formula for isolated orbits [48]: 






dr 



dr' 



= p - p' = 0, 



(3.7) 



J r' =r=rn 



5gsc\{E) 




po 



T, 



ppo 



: COS 



jdet (Mpo - 1)1 




(3.8) 



The sum here goes over all periodic orbits po, including their repeated cy- 
cles. Mpo is the stability matrix discussed in Section 3.3 below and 1 the 
unit matrix with the same dimension, Spo is the action integral along the 
(repeated) periodic orbit 

Spo{E) = I p • dr, (3.9) 

J po 

Tppo = dSppo/dE is the period of the primitive periodic orbit {i.e., its first 
cycle), and apo is the so-called Maslov index which arises as a sum of the 
phase index pLpo occurring in the semiclassical Green function (3.4) plus an 
index Vpa (being 1 or 0 for a two-dimensional system) that stems from the 
stationary-phase evaluation of the trace integral. (See also [58], where it 
has been shown that the total Maslov index apo = fJLpo + Vpo is a canonical 
and topological invariant, whereas pLpo and Vpo alone may depend on the 
starting point along the orbit or on the choice of the coordinate system). 
The index Upo may also be obtained as a winding number [58,59]. All 
quantities appearing on the r.h.s. of the trace formula (3.8) depend smoothly 
on the energy E. (The Maslov indices are integer constants in finite energy 
intervals but usually change abruptly by one or more units at bifurcations; 
see Sect. 3.4 below). 

For systems that contain continuous symmetries, most (or all) periodic 
orbits are not isolated in phase space but appear in degenerate families. 
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The single orbits within these families can be smoothly transformed into 
each other by a symmetry operation (translation, rotation, etc.) that does 
not affect their properties (actions, stabilities, Maslov indices). Conse- 
quently, the stationary-phase approximation to some of the trace integra- 
tions perpendicular to the orbits breaks down (namely the directions in 
which the orbits can be displaced by these symmetry operations and in 
which the actions remain constant). As a result, the determinant in the 
denominator of (3.8) becomes zero and the trace formula diverges. Similar 
problems arise also in isolated situations {e.g., at specific energies or values 
of some other continuous system parameter) in connection with orbit bi- 
furcations and symmetry-breaking (or symmetry-restoring) transitions. For 
these latter situations special uniform approximations can be developed, 
which will be briefly discussed in Section 3.4. 

The problem of continuous orbit degeneracies can be solved by per- 
forming exactly the trace integrations in those directions where the action 
remains constant [55,56]. (See [4] for a detailed discussion of different tech- 
niques used hereby) . The results of these extensions of the Gutzwiller theory 
are trace formulae of the general form, which also applies to those derived 
earlier for billiards [49] and integrable systems [50,51]: 



5gsc\{E) ~ '^Apo{E) cos 

po 



^Spo{E^ CTpo 



7T 

2 



(3.10) 



Here the sum goes over all distinct periodic orbit families (plus isolated 
orbits if they exist) . The overall Maslov indices apo may also become half- 
integer. The precise form of the amplitudes Apo depends on the particular 
symmetry of the system. The inverse power of h contained in these am- 
plitudes equals H-//2 where / is the degree of degeneracy of a given orbit 
family, i.e., the number of independent continuous parameters describing its 
degeneracy (or, equivalently, the number of different symmetry operations 
possible, or the dimension of the group creating these symmetry opera- 
tions) . Note that families with different degrees of degeneracy and isolated 
orbits with / = 0 may coexist in one and the same system. For example, 
in a three-dimensional system with spherical symmetry, the most degener- 
ate orbit families occur with a three-fold continuous degeneracy (/ = 3), 
because their possible orientations are described by the three Euler angles; 
the group creating the corresponding rotations is the SO(3) group; the am- 
plitudes A of these orbits are proportional to . In a spherical billiard, 
there is also the linear orbit bouncing along a diameter; its degeneracy 
is only / = 2 since rotation about its own direction does not create a new 
orbit. If the potential is a smooth and attractive central field V (r), there ex- 
ists also an isolated orbit with / = 0, namely the circular orbit with a radius 
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corresponding to the minimum of the effective potential V{r) + L^/2mr^ 
for a given non-zero angular momentum L. 

When applying the POT to a fermionic mean-field system, one is also 
interested in integrated quantities obtained from the level density. Its first 
integral gives the number of particles, see equation (2.3). Performing the en- 
ergy integration in the saddle-point approximation yields the trace formula 
for the oscillating part of the function N{Ep): 



SNsc\{Ef) ~ ^ 
po 




ApoiEp) sin 



^Spoi^Ep^ CTpo 



7T 

2 ’ 



(3.11) 



where Tpo is the period of the (repeated) periodic orbit (family) at the 
Fermi energy Ef- The oscillating part 6E of the total binding energy of an 
interacting system in the mean-field approximation can be shown [2, 55] to 
be given, to leading order in the oscillating part of the level density Sg, by 



rEp 

SEsci / {E-Ep)Sg{E)dE 
Jo 



- E (^) 



cos 



^ Spo{Ef^ CTpo 2 



(3.12) 



(See also [4] for a discussion of the “Strutinsky theorem” from which the 
first equation above can be derived). Note that in the above two trace 
formulae, only the orbit properties at the Fermi energy Ef are required. 

In most realistic physical systems, the periodic orbits and their proper- 
ties can only be calculated numerically (see Sect. 3.3) - which, as already 
mentioned, can be a very difficult task. For systems which are only weakly 
perturbed away from an analytically known integrable limit, a perturbative 
trace formula has been developed by Creagh [60], for which the perturbed 
orbits need not be determined. Assume a Hamiltonian of the form 



H = Ho + eHi, (3.13) 

where Hq is integrable and e is a small dimensionless number. The idea 
now is that to lowest order in e, the amplitudes and Maslov indices in the 
trace formula will not be affected by the perturbation, and that only the 
lowest-order change SSpo induced by eH\ has to be retained in the phase. 
The perturbed trace formula is then of the form 

(A) cs 3? 4°) {E) Mpo{x)A^^'"^ 

I VO 



(3.14) 
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where the quantities with superscript are those appearing in the trace 
formula for the unperturbed system Hq, and A4po{x) is a - generally com- 
plex - modulation factor defined as the average of the phase change due to 
SSpo over the unperturbed orbit family: 

Mpo(x) = . (3.15) 

\ / po 



The argument x of the modulation factor is proportional to the lowest power 
of e for which the action change 6Spo is non-zero, inverse proportional to Ti, 
and further depends on the energy E. The perturbative trace formula for 
the total energy is correspondingly given by 




By construction, the unperturbed trace formulae are recovered in the 
limit e ^ 0. For large values of e, this approach will eventually break 
down and uniform approximations must be used to recover the correct am- 
plitudes of the trace formula applying to the strongly perturbed system (see 
Sect. 3.4). 

All the above trace formulae are remarkable in the sense that they relate 
the quantum-mechanical oscillations contained in their left-hand side with 
a sum over periodic orbits on their right-hand side that includes only prop- 
erties of the classical system. They represent asymptotic series which can 
be expected to become exact in the limit Spo ^ h. As already mentioned 
in the introduction to this section, these series usually do not converge, and 
in many systems the problem of enumerating all periodic orbits and calcu- 
lating their properties is practically not solvable. There exist a few soluble 
(integrable and non-integrable) systems for which the trace formula can be 
shown to converge to a sum of delta functions if the smooth part g{E) of 
the level density is added to it - and for a few of those, the result is even 
identical to the exact quantum-mechanical level density [4, 52] . But these 
are rare exceptions. 

For the applications discussed here, both sides of the trace formulae are 
convoluted with a normalized Gaussian function exp [— (A/7)^]/\/7r7. Using 
saddle-point integration for the convolution of the periodic orbit sum, one 
obtains an extra factor 



exp 



( 'LEue 

V2 n 



(3.17) 



under the summations in all trace formulae above. This factor suppresses 
the orbits with longer periods and usually leads to the convergence of the 
orbit sums (c/. Ref. [53]). In particular, it shows that the gross-shell effects 
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are governed by the shortest orbits. Exact quantization by summing over 
all orbits can still be attempted by taking the limit 7 — > 0. 

Concerning the smooth part g{E) of the level density that was the sub- 
ject of Section 2, it has been shown [49, 61] to arise within the periodic 
orbit theory from the closed orbits of zero length (and hence zero action) 
that mathematically also contribute to the sum of trajectories in the semi- 
classical Green function (3.4) when taking the trace integral in (3.1). Since 
their actions (and also their Maslov indices) are zero, they do not contribute 
in an oscillatory fashion to the level density but just constitute its phase- 
space average which yields the (extended) TF approximation as discussed in 
Section 2. This becomes particularly transparent in the derivation of the 
trace formula for integrable systems by Berry and Tabor [50] (see [4] for 
some exactly soluble models). 



3.3 Calculation of periodic orbits and their stability 

We briefly present here the linear stability analysis of periodic orbits and 
how the stability matrix can be used to And periodic orbits. We start from 
a holonomic Hamiltonian with a local potential V (r) in D dimensions: 

77(r,p) = ip2 + y(r). (3.18) 



(We put the mass equal to unity and assume the kinetic energy to be 
quadratic in p just for simplicity of the presentation here; any general 
dependence of p and r is allowed in principle). Hamilton’s equations of 
motion 



P = 



dH 
dr ’ 




(3.19) 



are in general non-linear. Assume now that we have found a periodic orbit, 
z.e., a T-periodic solution 



ro(t) = ro(t-kT), po(t) = po(t -k T), (3.20) 

and that E(r) in equation (3.18) is holomorphic around E(ro). (We shall 
turn further below to the question how to And such a periodic orbit practi- 
cally). We now want to investigate the stability of this solution with respect 
to some small perturbations <5r, <5p: 



r = ro -k i5r, p = pp ,5p. (3.21) 

Substituting (3.21) into (3.19) and expanding up to second orde with respect 
to the perturbations, we And the 2D linearized diflerential equations for i5r 
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and i5p, also called the Poincare variational equations: 



d / (5r 
di y (5p 



0 1 

-V"o(t) 0 




(3.22) 



where 0 and 1 are the H-dimensional zero and unit matrices, respectively, 
and V"o(t) is the matrix of second derivatives of the potential P(r), taken 
at ro(t): 



(V"o).,(t) 



d^V(r) 



dridrj 



r=ro(t) 



{i,j = 1,2,...,D). 



(3.23) 



Equation (3.22) is a set of linear differential equations with T-periodic co- 
efficients. From their general theory (see, e.g. [62]) one learns that the 
evolution of some initial perturbation <5r, (5p at time t = 0 up to the time t 
is given by 



( ) = X«) ( ) . 

V <5p(t) J \ Sp{0) ) 



(3.24) 



where X(t) is the so-called “matrizant” of dimension 2D. From 

equation (3.22) it is seen that the matrizant fulfills the differential equa- 
tion 




0 1 

-V"o(t) 0 



X(t) 



(3.25) 



with the initial conditions 



(X)„„(0) = (5mn (m,n = 1,2, . . . ,2L)). (3.26) 



The value of the matrix X(t) at the time of the period T is called the 
“monodromy matrix” M: 

M = X(T). (3.27) 

M is a symplectic {2D x 2D) matrix. According to the Lyapounov-Poincare 
theorem, its eigenvalues Xm (m = 1,2,...,2H) are pairwise inverse. Fur- 
thermore, two of the eigenvalues are always equal to unity, corresponding 
to small perturbations that either occur along the orbit or amount to a 
rescaling of the energy. Thus, M can always be brought into the form 



M = 






(3.28) 



where the 2D — 2 dimensional submatrix M is the stability matrix appearing 
in the denominator of Gutzwiller’s trace formula (3.8). The stability matrix 
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thus describes the propagation of a small perturbation (rj_,pj_), transverse 
(in phase space) to the given periodic orbit, over one period T: 



( U M ( ^ 

1 ip±m j V sp±(o ) ) 



(3.29) 



Its eigenvalues give us the information on the stability of the periodic orbit. 
Note that detM = 1, which expresses the fact that the Poincare mapping 
defined by equation (3.29) is area preserving. 

For two-dimensional systems {D = 2), the stability matrix is also two- 
dimensional and the stability analysis becomes particularly simple. Its two 
eigenvalues are just A and 1/A, so that 



trM = A-kl/A (D = 2). 



(3.30) 



For stable orbits, the eigenvalues lie on the complex unit circle, so that 
A = e^*"^ and a small perturbation just keeps oscillating around the peri- 
odic orbit. The real number k is called the stability angle. For unstable 
orbits, the eigenvalues are of the form A = ±e^^, where the real y > 1 is 
the Lyapounov exponent which measures the rate at which a small initial 
perturbation is growing exponentially in magnitude. Actually, one does not 
need to compute the eigenvalues of M to find out if an orbit is stable or 
not: since, as is easily seen from the above, |trM| > 2 for unstable and 
|trM| < 2 for stable orbits, it is enough to know the trace of M. When 
|trM| = 2, an orbit is called neutrally stable or marginally stable; this is 
the case for all families of degenerate orbits occurring in (integrable) sys- 
tems with continuous symmetries. It is easily seen that the quantity under 
the root in the denominator of the trace formula (3.8) for H = 2 becomes 
|det (Mpo — 1)1 = |2 — trM|, so that the formula diverges for all contri- 
butions from marginally stable orbit families. The other situation where 
the same divergence occurs is, as already mentioned in Section 3.2, when 
tr M = +2 for an isolated orbit. This happens in connection with orbit 
bifurcations which we will discuss briefly in Section 3.4. 

Using the stability matrix we can now give a practical recipe for finding 
periodic orbits of a Hamiltonian system. The idea is to exploit 
equation (3.29) for a Newton-Raphson iteration procedure (see, e.g. [59]). 
For this, we have to calculate the stability matrix during our search for peri- 
odic orbits. This is done by solving simultaneously the Hamilton equations 
of motion (3.19) and the equation of motion for the matrizant X(t) given in 
equation (3.25). (Actually, to get rid of the two irrelevant degrees of free- 
dom along the orbit, one may transform immediately to the coordinates rj_, 
Pj^ orthogonal to the orbit, to obtain the reduced part X(t) which after one 
period T gives M = X(T); see [59] for the equations of motion for X(t)). 
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Let us introduce a short notation z = (rj_, pj^) for the orthogonal phase- 
space vector. Usually, the search of periodic orbits is done on a Poincare sur- 
face of section, on which small displacements are automatically transverse 
to the searched orbit. We now assume that we have found an approximate 
periodic solution with the approximate period T. If we are close enough to 
the fixed point z* corresponding to the exact periodic solution, the remain- 
ing error of our approximate first starting point zi will propagate according 
to equation (3.29). After one period, the starting point zi is mapped onto 
z(, so that we have moved over an interval Azi = ’zl^ — zi. The distances 
of the two points from the unknown fixed point z* are <5zi = zi — z* and 
<5z( = z']^ — z*. If (5zi and are sufficiently small, they obey the linear 
differential equation (3.29), so that 

bzl^ = M(5zi, (3.31) 

from which we get, using Azi = <5z( — (5zi, 

Azi = (M- l)(5zi. (3.32) 

This equation can be inverted to obtain the error (5zi of our starting point 

(5zi = (M- l)-iAzi, (3.33) 

which now can be used to correct the starting point to Z 2 = zi — <5zi. Iterat- 
ing in this way, the fixed point is usually found within very few steps. The 
only problem with this procedure is that for an orbit with two eigenvalues 
A = 1, the inversion of (M — 1) is not possible, giving exactly the same 
divergence that appears in the trace formula (3.8). In practice, this is not 
a big handicap; when working with a sufficient numerical accuracy, one can 
come sufficiently close to the point where tr M = 2. 

The Newton-Raphson iteration converges very fast, once one comes close 
enough to a periodic orbit (be., a fixed point). It is therefore possible to 
search periodic orbits just by scanning a given Poincare surface of section, 
either systematically in small steps, or just randomly. The practical expe- 
rience shows that, different from what one might expect at first thought, 
stable and unstable fixed points are found with equal probability. 

3.4 Uniform approximations 

A classical system that is neither integrable nor fully chaotic is usually called 
a “mixed system”, or a system with mixed classical dynamics. It is char- 
acteristic of such systems that periodic orbits change their stability upon 
variation of a smooth parameter. Such a parameter may be the energy, any 
parameter appearing in the potential V{v) {e.g., a deformation parameter). 
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or the strength of an externally applied electric or magnetic field. Let us 
call this parameter a. Whenever a pair of eigenvalues of the stability matrix 
for a periodic orbit has the value A = 1 {i.e., trM = 2 for a two-dimensional 
system) at a = ao, then the orbit undergoes a bifurcation. What typically 
happens at a bifurcation is that an orbit changes from stable to unstable 
(or vice versa), and hereby one or several new orbits are “born”. (Orbits 
may also just “touch” the value tr M = 2 at a = uq without changing their 
stability, or a pair of stable and unstable orbits may emerge for a > ao and 
not exist at all for a < uq). Infinitesimally close to the bifurcation point 
ao, the new “child” orbits have the same shape and action as the “parent” 
orbit (but usually a different Maslov index). As one changes a away from 
ao, the new orbits develop independently, and eventually change their sta- 
bility again at a new bifurcation point ai . Often, periodic orbits proliferate 
this way in chains of repeated bifurcations, leading to an increased degree 
of chaoticity (c/. the so-called Feigenbaum scenario [63,64] and [65]). 

This behaviour of periodic orbits in mixed systems makes the semiclassi- 
cal POT difficult, since the trace formulae diverge at each bifurcation point. 
Mathematically speaking, the divergence comes from the fact that one of the 
stationary phase integrations used in their derivation (see Sect. 3.2) breaks 
down. The remedy is, in principle, to expand the action integral in the 
exponent of the semiclassical Green function (3.4) up to higher order and 
then to do the corresponding integration. For instance, when integrating 
along a direction x, one may have to expand the exponent up to third order 
in (x — xo), where xq is the stationary point; the integral over x then would 
lead to an Airy function. More generally, one expands the action integral 
around the stationary point rg, po in phase space (corresponding to the 
bifurcation point uq) into so-called normal forms [66] and then performs the 
trace integrations (if possible, analytically) . This locally removes the singu- 
larity at a = ao, but it does not yet guarantee that the result can be used 
further away: when a is sufficiently different from og one wants to recover 
the original Gutzwiller amplitudes Apo- To guarantee this, more refined 
techniques are required, yielding so-called “uniform approximations” . Such 
approximations have been developed systematically for all generic types of 
orbit bifurcations by Sieber and Schomerus [67], and for pairs of interfering 
bifurcations by Schomerus [68]. We refer to their articles for further de- 
tails; the resulting trace formulae have finite amplitudes at the bifurcation 
points and go over into the standard Gutzwiller trace formula away from 
the bifurcations. 

Uniform approximations are also required when the change of a smooth 
parameter, or its being switched from zero to non-zero (or vice versa) leads 
to the breaking (or restoring) of a continuous symmetry. As we have dis- 
cussed in Section 3.2 after equation (3.10), the power of U contained in the 
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Gutzwiller amplitude Apo depends on the number / of continuous sym- 
metry parameters describing the degeneracy of orbit families. When / is 
changed by one or more units upon variation of a parameter, the original 
form of the amplitude becomes invalid and therefore diverges. Start, e.g., 
from a two-dimensional billiard with oval boundary, which is non-integrable 
(with / = 0) so that the Gutzwiller trace formula (3.8) applies. Now, when 
letting its shape go to a circle, tr M of all the isolated periodic orbits will 
approach the value +2. Thus the trace formula diverges in the limit where 
the circular U(l) symmetry is restored and the billiard becomes integrable 
(with / = 1). (The amplitudes there are all proportional to 

Such a symmetry breaking arises generically if we perturb an integrable 
Hamiltonian like in equation (3.13) by switching on the parameter e. For 
small values of e, the perturbative trace formula (3.14) solves this problem 
but, as already mentioned, for large values e its amplitudes do not go over 
to the correct Gutzwiller amplitudes of the perturbed system. The tech- 
niques used to achieve this are similar to those used to derive the uniform 
approximations for bifurcations. For the breaking of U(l) symmetry, a gen- 
eral uniform approximation has been derived by Tomsovic et al. [69], and 
uniform approximations for SU(2) and SO(3) breaking have been derived 
for some specific systems in [70]. 

Finally, we mention that in [71] an analytical trace formula has been 
given for the two-dimensional elliptic billiard in which - although it is in- 
tegrable! - an infinity of bifurcations occur that cumulate in the limit of 
zero eccentricity (z.e., the circular billiard), so that both bifurcations and 
symmetry breaking had to be overcome. 

3.5 Applications to metal clusters 

The examples given here for metal clusters will be short, since they have 
already been discussed extensively at various occasions [4,31,72]. 

3.5.1 Supershell structure of spherical alkali clusters 

Based upon the semiclassical trace formula derived by Balian and Bloch [49] 
for the oscillating level density of a spherical billiard, and extending the 
semiclassical POT analysis to Woods-Saxon type shell-model potentials for 
large clusters, Nishioka et al. [73] predicted that supershell structure should 
be experimentally observable in large metal clusters. This was, indeed, 
verified in the experiments of the Gopenhagen-Stuttgart collaboration [74]. 
Figure 7 shows the oscillating part of the cluster abundance in an adia- 
batic expansion source. Shown are the statistically averaged fist differences 
of the logarithmic yields In, multiplied by a factor that accounts for the 
temperature suppression effects (see [30, 75] for a theoretical investigation 
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Fig. 7. Experimental observation of the supershells. Shown are the logarith- 
mically derived relative yields of sodium clusters from an adiabatic expansion 
source, scaled by an extra factor exp(cA^'^®) that takes account of the finite 
temperature effects. (After [74]). 



of these effects in finite-temperature Kohn-Sham calculations for canonical 
ensembles of spherical jellium clusters). 

The negative peaks in Figure 7 correspond to the most stable clusters 
which are understood as those with filled spherical electronic shells with the 
“magic numbers” indicated. Note that they appear at equal distances on the 
scale which is proportional to the cluster radius. The constant spacing 
s = is experimentally found to be s = 0.61 ± 0.01, also for other 

metal clusters (Li, Ga). The beating amplitude of the shell oscillations is 
understood in the POT from the interference of the shortest periodic orbits 
with largest amplitude, which here are the triangular and square orbits. 
Their average length determines the shell spacing s; from the trace formula 
of a spherical billiard [49] one finds the value s = 0.603 which is independent 
of the Wigner-Seitz parameter Vs of the metal. The calculations of Nishioka 
et al. yield s = 0.607; the same value is found in the jellium-model DFT 
calculations for spherical Na clusters [30]. (Some finer details in connection 
with the Ga experiments are discussed in [76]). The beat length, i.e., the 
period of the supershells, is given essentially by the difference between the 
lengths of the leading orbits. It does depend on the metal type (see a 
detailed investigation in [77]). 
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The effect of a magnetic field on the supershell structure was investi- 
gated semiclassically with the perturbative trace formula of equation (3.14) 
in [78]. The field strengths necessary to detect interesting effects are, how- 
ever, not experimentally available. (This is different for the mesoscopic sizes 
of quantum dots, as discussed further below) . 

3.5.2 Ground-state deformations 

As we know from nuclei, finite fermion systems may become deformed if 
a spherical shell is not completely filled: in order to avoid the degener- 
acy of the wavefunction, the system responds by a spontaneous breaking 
of the spherical symmetry. In molecular physics, this corresponds to the 
well-known Jahn-Teller effect. In the pioneering experiments on sodium 
clusters by the group of Knight [79, 80], the finer structures in the mass 
yields between the spherically magic numbers were, indeed, understood in 
terms of a deformed shell model (see [40] for details). Most of the smaller 
minima appearing in Figure 7 for cluster sizes up to ~ 800 could, in fact, 
also be attributed to deformed cluster shapes [81]. 

The POT analysis for deformed potentials becomes more complicated, as 
these usually are not integrable. Simple estimates of cluster deformations 
are, however, possible in terms of integrable deformed model potentials. 
This has been done successfully in nuclear physics by Strutinsky et al. [55]. 
Assume that only one periodic orbit, or one family of periodic orbits, is 
dominating the shell-correction energy 5E given by (3.12). The condition 
for finding its extrema then reduces to 

5Spo = 0. (3.34) 

This idea has been used for metal clusters in [82]. Figure 8 compares semi- 
classical estimates of the ground-state quadrupole deformations 5 with re- 
sults of quantum-mechanical calculations. For small clusters up to ~ 40, 
the selfconsistent Kohn-Sham field is well approximated by an axially de- 
formed harmonic oscillator whose exact trace formula is known analyti- 
cally [83]. In the limit of large clusters, the mean field has very steep walls, 
as demonstrated by the success of the spherical billiard model for the super- 
shells. For large deformed clusters it may therefore be approximated by a 
spheroidal billiard, whose approximate trace formula is also known analyt- 
ically [84]. The heavy lines in Figure 8 give the semiclassical estimates of 5 
using these exact trace formulae and equation (3.34). The thin lines are the 
quantum-mechanical results from KS calculations in the deformed jellium 
model [85] for N < 40, and for N > 40 from calculations using Strutinsky’s 
shell-correction method [2] in a phenomenological modified Nilsson model 
for large clusters that yields the spherical KS spectra [81]. 
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Fig. 8. Ground-state quadrupole deformations S of sodium clusters versus cluster 
size N. The solid thin lines for A < 40 are the results from KS calculations 
using the spheroidal jellium model [85], and for A > 40 those from microscopic 
Strutinsky calculations using a modified Nilsson model that is adjusted to the 
spherical jellium-KS results [81]. The heavy lines give the semiclassical estimates 
based on equation (3.34) using the trace formula of an axial harmonic oscillator 
for A < 26 and that of a spheroidal billiard for A > 40. (From [82]). 



More realistic deformed potentials have mixed classical dynamics, which 
leads to the problems discussed in Section 3.4 above. The POT analysis 
then depends on finding the most important short orbits and treating their 
bifurcations correctly. However, for the ground states of larger systems the 
deformations are usually not very large - as can be recognized, e.g., from 
the decreasing amplitudes of S seen in Figure 8. It is therefore possible to 
treat the deformed part of the potential in perturbation theory, using the 
trace formula (3.16) discussed above. For a slightly quadrupole-deformed 
billiard the modulation factor (3.15) can be calculated analytically [72,86]; 
hereby the unperturbed system is the spherical billiard. In Figure 9 we 
show by contour plots the shell-correction energy 6E as a, function of par- 
ticle number N and quadrupole deformation £ 2 - The agreement of the 
quantum-mechanical (left) and semiclassical results (right) demonstrates 
the quantitative predictive power of the semiclassical theory. 

3.6 Applications to two-dimensional electronic systems 

The recent developments in the fabrication of semiconductor devices on 
micro- and nanometer scales have opened up interesting new studies of the 
interplay of classical and quantum mechanics. Quantum dots and other 
nanostructures with any wanted geometry can be produced at scales where 
on one hand the coherence length of the electrons is larger than their confine- 
ment size, so that quantum interference takes place, but on the other hand 
their Fermi wave length is small enough so that high quantum numbers are 
involved and the classical limit is within reach. In addition, these structures 
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Fig. 9. Contour plots of the shell-correction energy SE for an axially quadrnpole- 
deformed cavity versus particle nnmber N and quadrupole deformation E 2 - 
Left: quantum- mechanical result, right: semiclassical result using the perturba- 
tive trace formula (3.16); the unperturbed system is the spherical billiard [49]. 
(From [86]). 



are of the order of, or bigger than, the magnetic length scales that can be 
reached experimentally. Hence, the strength of an applied external magnetic 
field represents a new degree of freedom for the observation of interesting 
quantum effects, such as the quantum Hall effect (QHE) and its various 
manifestations in connection with a laterally confined two-dimensional elec- 
tron gas (2DEG). 

In the following we shall give three applications of the POT to such 
mesoscopic systems, where the magnetic field plays an important role. In 
the first one, a circular quantum dot, the observed conductance oscillations 
could be directly related to the density of states of the confined electrons 
and hence be interpreted in terms of the trace formula (3.10). The other 
two examples concern open systems where the conductance is described 
within the Kubo formalism. The semiclassical description of such systems 
has been developed in [87, 88] . In the quantum-mechanical Kubo formula 
based on the linear response theory, the Green function is replaced by its 
semiclassical approximation (3.4). The oscillating parts of the longitudinal 
and transverse conductivity Sgxx and 5axy, respectively, with respect to an 
applied external voltage in the x direction (2 = 0 is the plane of the 2DEG) 
can then be expressed in terms of periodic orbits through the following 
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“semiclassical Kubo formulae” (including a spin degeneracy factor 2) : 
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(3.35) 




Here Spo, Opo, and Mpo have the same meaning as in the trace formulae 
presented in Section 3.2 above; all are evaluated at the Fermi energy Ey- 
A finite temperature T is included in the factor Rpo{Tf}) = {Tpolrf})! 
s\Vi]i{Tpo/Tf}) with the period Tpo = dSpo/dE and T /3 = h/{'KkT). Damp- 
ing due to a finite mean-free path is included by the factor Epo^Ts) = 
exp [— Tpo/(2rs)], where Tg = m*fj,fe is the scattering time extracted from 
the experimental mobility fx, and m* is the effective mass of the electrons. 
A is the effective area of the investigated device, and Cij are the velocity- 
velocity correlation functions of the periodic orbit, defined by 



C 



ij 





dr Ui(r) Vj{t + 



t) 



{i,j = x,y) 



(3.36) 



in terms of the velocities Vi{t) = pi{t)/m* of the periodic orbits. 



3.6.1 Conductance oscillations in a circular quantum dot 

In a circular quantum dot with approximately 1200 - 1500 confined elec- 
trons, the conductance was measured [89, 90] as a function of an applied 
external (perpendicular) magnetic field B and of the applied gate voltage 
Vgi that defined the effective radius of the confined 2DEG. Regular oscilla- 
tions of the conductance as a function of both Vgi and weak field strength 
B in the milli-Tesla domain were observed, as shown in the left-hand part 
of Figure 10. A semiclassical reproduction of these oscillations in terms of 
the trace formula of a circular billiard was given in [91] and is shown in the 
right-hand part of Figure 10. Both the experiment and the semiclassical 
analysis are well described in the quoted literature; we just summarize the 
essential conclusions here. 

Like in the large metal clusters, the selfconsistent mean field of this quan- 
tum dot is expected to have steep walls due to the strong mutual Coulomb 
repulsion of the trapped electrons. Hence, a two-dimensional circular bil- 
liard is a good first-order model. The period of the oscillation in the vertical 
direction, i.e., as a function of the dot radius R, is given exactly like for 
the spherical metal clusters by the average length of the leading periodic 
orbits. In contrast to the three-dimensional spherical billiard, all orbits in a 
two-dimensional circular billiard have the same one-dimensional degeneracy. 
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Fig. 10. Gray-scale plots of conductance of a circular quantum dot, plotted versus 
gate voltage (vertical axis) and magnetic field (horizontal axis) . Left: experimental 
result, right: semiclassical result using the circular billiard model including the 
magnetic field in first-order perturbation. (From [91]). 



and the leading orbit here is the diameter orbit. Using the average length of 
the diameter and the triangular orbits leads, indeed, to the correct period of 
the vertical oscillations seen in Figure 10. (Note, however, that the number 
of electrons and the radii R were only known approximately here). The 
oscillation as a function of a weak magnetic field B is well understood in 
perturbation theory where only the first-order corrections to the actions Spo 
are taken into account. The modulation factor (3.15) then becomes [91] 

Mpo = cos (^^FpoB'j , (3.37) 

where Fpo is the area included by the orbit po. The origin of this modu- 
lation factor is just the Aharonov-Bohm (AB) phase of a charged particle 
surrounding a magnetic field, i.e., the magnetic flux included by the elec- 
tron’s orbit. Here the phase adds to the classical action, the sign of the 
flux depending on the orientation of the orbit with respect to the B held. 
The summation over both directions yields the cosine factor in (3.37). The 
period of these oscillations for weak B - often called “AB oscillations” - 
are here essentially given by the shortest orbit with a non-zero area, i.e., by 
the triangular orbit. This example shows how through the introduction of a 
weak magnetic held, one is able to “measure” not only the average length, 
but independently also the average area of the leading periodic orbit (s). 

Similar AB oscillations have also been observed in a quantum dot with 
an equilateral triangular shape [92,93]. In the steep-wall limit, one may use 
a triangular billiard whose exact trace formula is known analytically [4] ; the 
period of the measured AB oscillations could, indeed, be understood by the 
area of the shortest periodic orbit of this system. A challenging speculation 
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Fig. 11. Semiclassical calculation of the transverse Hall resistivity pxy (left 
axis) and the longitudinal resistivity pxx (right axis) of the 2DEG in a transverse 
magnetic field B. pxx shows the typical 1/B-periodic Shubnikov-de Haas oscil- 
lations. pxy exhibits the plateaux corresponding to the integer QHE. Solid lines 
are obtained including only the lowest-order terms in the semiclassical Kubo for- 
mula (3.35). Dashed lines include the only h-correction to the first term of Soxy 
in (3.35). The inserts illustrate small variations of the mobility of the electrons 
(left) and of the temperature (right). (From [99]). 



was proposed for the study of a triangular quantum dot with open contacts 
and a small number of electrons [92,94]. Neglecting the Coulomb repulsion, 
the potential of the electrons then is given to lowest order by the famous 
Henon-Heiles potential [95] which has mixed classical dynamics. The semi- 
classical description of its level density using the Gutzwiller theory has been 
very successful (see [70] and earlier papers quoted therein). It would be in- 
teresting to observe the transition from regular to chaotic motion by tuning 
the Fermi energy of the electrons. This has, however, not been realized 
experimentally so far. 

Before turning to open nanostructures, we conclude this section on quan- 
tum dots by mentioning that for the disk billiard in arbitrarily strong ho- 
mogeneous magnetic fields, an analytical trace formula has been developed 
in [96] . Magnetic properties of quantum dots and dot arrays have also been 
investigated within the POT [97,98]. 
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3.6.2 Integer quantum Hall effect in the two-dimensional electron gas 

An unexpected semiclassical interpretation of the integer QHE was recently 
proposed by Blaschke [99]. In a typical Hall bar experiment, one measures 
the transverse (Hall) resistivity pxy and the longitudinal resistivity pxx with 
respect to an applied electric voltage in the x direction; the 2DEG is con- 
fined to the z = 0 plane. The elements of the resistivity tensor pij are 
given by inverting the conductivity tensor (jy, and their oscillating parts 
can thus be calculated semiclassically from the Kubo trace formulae (3.35). 
The periodic orbits of a 2DEG in a transverse magnetic field are simply 
the cyclotron orbits whose properties are easily calculated; their continuous 
degeneracy cancels the factor A. Adding the average resistivities, which are 
given within the classical Drude model, yields the results shown in Figure 11. 
The solid lines give the results obtained by keeping only the lowest-order 
terms in h of the trace formula (3.35). They correspond to the classical 
Hall resistivity pxy which is linear in B, and to the longitudinal resistivity 
Pxx exhibiting the typical Shubnikov-de Haas oscillations. (These are easily 
understood semiclassically in terms of the magnetic flux surrounded by the 
cyclotron orbits which is proportional to 1/H; see, e.g. [4]). To lowest order 
in U, the POT is not able to reproduce the plateaux in the Hall resistivity. 
However, when adding the (only! - see [99]) h correction coming into the 
term proportional to dSpo/dB in the trace formula (3.35) for Saxy (dashed 
lines), one obtains the plateaux in pxy characteristic of the integer QHE; 
the influence of this correction on pxx is very small. 

When the 2DEG is modulated laterally by an “antidot superlattice”, 
interesting commensurability peaks can be observed in the longitudinal re- 
sistivity, which correspond to electrons trapped on specific cyclotron orbits 
that fit around one or more antidots [100]. At low temperatures, small 
AB oscillations in pxx become observable and can be qualitatively repro- 
duced [101] by the interference of the most important periodic orbits via 
the trace formulae equation (3.35). (See [101] for a mini-review of the so- 
called “Weiss oscillations”.) 

3.6.3 Gonductance oscillations in a channel with antidots 

Our last example is a mesoscopic arrangement in which the electrons of 
a 2DEG are confined laterally to a channel of width ~1.0 /xm. Two an- 
tidots represent obstacles to the electric current through the channel; the 
effective radius of these antidots can be regulated by an applied gate volt- 
age Vg. Figure 12 shows an SEM photograph of the experimental gate 
structure [102]. The longitudinal conductance Gxx along the channel has 
been measured for various strengths of a perpendicular magnetic field B and 
gate voltages Vg [102,103]. Like in the case of antidot supperlatices [100], a 




M. Brack: Semiclassical Approaches 



201 




Fig. 12. SEM photograph of the gate structure of a mesoscopic channel with two 
antidots [102] (without contacts). Left: model potential used for the calculations. 
Center: typical periodic orbits encircling the antidots. (From [104]). 



commensurability minimum in the average conductance has been observed 
near those values of B for which a cyclotron orbit fits around the antidots. 

Small observed AB oscillations around the average part of Gxx can again 
be interpreted semiclassically [99, 104] as interferences between the leading 
periodic orbits (a few of them are shown in Fig. 12 by solid and dashed white 
lines) through the trace formulae (3.35). The result of the semiclassical cal- 
culation of 5Gxx is shown in Figure ??; hereby the parameters of the model 
potential used to describe the lateral confinement (including the antidots) 
have been optimized [99]. Like in the case of the Weiss oscillations [101], 
the amplitude had to be adjusted since the semiclassical Kubo formula is 
not able to reproduce quantitatively the amplitude of the oscillations. 

An interesting phenomenon is observed when varying both the magnetic 
field B and the gate voltage Vg, and plotting the loci of the oscillation 
maxima in SGxx- These arrange themselves, as seen in Figure ??a, along 
smooth lines whose slopes are well understood in terms of the B and Vg 
dependence of the actions Spo of the leading periodic orbits. However, some 
characteristic dislocations occur at apparently random places in the (H, V^) 
plane, as emphasized by the boxes. In the semiclassical analysis, they origi- 
nate from successive bifurcations of periodic orbits: the different orbit gen- 
erations lead to different slopes in Figure ??b, and these do not match 
near the loci in the {B, Vg) plane (shown for some leading orbits by gray- 
shaded thick lines) along which the bifurcations occur. Although the the- 
ory does not fit the experiment globally (at least 10 different orbit families 
contribute), the local agreement near the dislocations is excellent; see the 
box in Figure ??c. Thus, orbit bifurcations can be “seen” in the experiment! 
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Fig. 13. Oscillating part of the conductance 5Gxx (in arbitrary units) versus mag- 
netic field strength B (in Tesla). Solid line: experiment for Vg = —1.50 Volts [103]; 
dashed line: semiclassical result with optimized potential parameters [99]. 




Fig. 14. Maximum positions of SGxx versus B (vertical axes) and Vg (horizontal 
axes), (a) Experimental values [103]. (b) Semiclassical results [104]; s<j is the anti- 
dot radius regulated by Vg; approximately we have Sd oc V].; the gray-shaded lines 
correspond to the loci of bifurcations of some leading orbit families, (c) Behaviour 
near a dislocation (dots: experiment; lines: semiclassical results). (From [104]). 



4 Local-current approximation for linear response 

We shall in this final section present a semiclassical approach to the collec- 
tive excitation spectrum of a finite interacting fermion system. The semi- 
classical aspect here pertains to the calculation of the excitation 
energies, starting from a ground state that has been obtained either 
quantum- mechanically or semiclassically in the mean- field approximation. 
This approach is based on the use of generalized sum rules [105] and is 
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particularly well suited for the description of collective excitations such as 
giant resonances in nuclei (see [106] for a review) or plasmon resonances 
in metal clusters (see Sect. IV. C of the review article [31] on metal clus- 
ters). For the latter application, this approach has been developed first 
within the ETF framework [27] and later in connection with the Kohn- 
Sham formalism, where it was shown [107] to be an approximation to the 
random phase approximation (RPA) . Since it makes use of localized collec- 
tive current distributions, it was termed “local RPA” (LRPA, see also [108, 
109]). The method can, however, be derived [110] from the most general 
quantum-mechanical equations of motion that yield the exact excitation 
spectrum; the basic semiclassical assumption is, as previously, that the col- 
lective currents are given by local functions of r. We therefore propose [111] 
to term this approach the “local-current approximation” (LCA) which does 
not restrict its name to further approximations inherent in the RPA. 

We first state in Section 4.1 the quantum-mechanical equations of motion 
and relate them to a variational principle, and in Section 4.2 we sketch 
briefly the variational equations that are obtained if the basic excitation 
operators are assumed to be local functions of r. In Section 4.3 we give a 
practical way to solve these equations approximately in finite basis sets and 
give a physical interpretation of the LCA, and in the remaining parts we 
shall present some results of applications to metal clusters. 

4.1 Quantum-mechanical equations of motion 

In this and the next subsection we follow closely [110]. We consider a many- 
particle system defined by the Hamiltonian H = T V , where V includes 
external one-body potentials and the two-body interaction. (For simplicity, 
we omit in the following the “hat” symbol ” to indicate operators.) The 
exact ground state |0) and excited states \u) are given by the stationary 
Schrodinger equation 

H\u) = E,\u) = {Eo + noj,)\u). u = 0,l,2,... (4.1) 

It is a matter of standard many-body theory (see, e.g. [112-114]) to 
rewrite (4.1) in the form of the following equations of motion 





= fiu:AQ\0,Ol\Q) , 


(4.2) 


(o|a[i^,a]|o) 


= hu;,{Q\0,0^\Q) = Q , 


(4.3) 



where the operators Oi, and Oj, are defined as the creation and annihilation 
operators of the excited states: 

Ol\Q) = \u), a|zz) = |0), and a|0) = 0. 



(4.4) 
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The exact solutions of the equations (4.2, 4.3) are unknown for most sys- 
tems, as are those of the Schrodinger equation (4.1). Several approximative 
ways to find the excited energies TilVi, and/or their eigenstates have been de- 
veloped; they differ in the approximations used for the ground state |0) and 
the explicit form of the excitation operators O/. In the Tam~DancofF scheme 
one starts from the HF approximation, where the ground state |0) = |HF) 
is a Slater determinant, and the excitation operators are taken to be sim- 
ple one-particle-one-hole {Ip-lh) operators. In the small-amplitude limit of 
the time-dependent HF theory, the so-called random phase approximation 
(RPA) is obtained by admitting 2p-2h excitations in the ground state 



|0) = |RPA) = 1 + ^ |HF) , (4.5) 

y pp'hh' j 

and taking the excitation operators to be linear combinations of \p-lh ex- 
citations of the form: 

^^(RPA) ^ ^ . (4.6) 

ph 



In the above definitions, a) and a are the single-particle creation and an- 
nihilation operators for particle states (with ep,£p' > Ap) and hole states 
(with eh,Sh' < Ep), respectively, with respect to the HF ground state. We 
refer again to the literature for the details of these approximations [114]. 

For the following it is now essential that a variational principle can be 
formulated [110] which is exactly equivalent to solving the general equations 
of motion (4.2, 4.3). For that we introduce a Hermitian operator Q that 
can be interpreted as a generalized coordinate. To find the lowest excited 
state, one has to solve the variational equation 



SEs[Q] 

SQ 



(4.7) 



where E^IQ] is defined by 



E^IQ] = 




(4.8) 



and the “moments” mi and m3 (see the discussion of their physical sig- 
nificance in Sect. 4.3 below) are defined as expectation values of multiple 
commutators 

= i(0|[Q,[i7,g]]|0), 

= ^{0\[[H,Q],[[H,Q],H]]\0}- 



mi[Q] 
m 3 [Q] 



(4.9) 

(4.10) 
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The minimum value of after variation gives the first excitation en- 
ergy Tilv\. The corresponding operator Q\ is written as the linear com- 
bination 

Qio^O\ + Oi (4.11) 

of the creation and annihilation operators for the first excited state. The 
second excitation with energy T 1 UO 2 can then be obtained from variation of 
Q in an operator space that has been orthogonalized to Q\, and so on. In 
this way the whole excitation spectrum TilVi, can be successively constructed 
(see [110,111] for the details). 

In principle, the exact excited states and excitation energies of the sys- 
tem are given by the solutions of the variational equation (4.7). For practical 
calculations, however, one is forced to make some ansatz for the generally 
non-local operator Q. If it is taken from the space of particle-hole op- 
erators of the form (4.6), then equation (4.7) leads precisely to the RPA 
equations [107]. (Note that the RPA, in a consistent use within the DFT, 
includes the exchange-correlations effects. In the condensed matter litera- 
ture, however, the corresponding approach is more often referred to as the 
“time-dependent LDA” or TDLDA approach, whereas the name RPA is 
reserved to an approximation which ignores the exchange-correlations ef- 
fects. Nuclear physicists usually include the exact exchange in their RPA 
treatment, starting from the HF ground state as implied in equation (4.5) 
above). Even in this approximation, the equations of motion are very diffi- 
cult to solve for clusters without any spatial symmetry, e.g., including the 
ionic structure. In the following we shall discuss a semiclassical approach 
which makes use of a local approximation to the operator Q and has been 
successfully used for collective excitations both in nuclear [106] and cluster 
physics. 

4.2 Variational equation for the local current density 

Starting from the exact equations of motion defined by the variational prin- 
ciple equations (4.7-4.10), we now make the following two approximations. 
First, we write the ground-state energy as usual in DFT as a functional of 
the local density, E = A[p(r)] as in (2.19), using a standard approximation 
for the exchange-correlation functional {e.g., the LDA). Second, we assume 
Q to be a local function ofr: 



Q = Q{r). (4.12) 

Then, for an electronic system where the external potentials are local and 
the two-body interaction is just the bare Coulomb interaction, the commu- 
tator [E{, Q] is only given by its kinetic part {i.e., the potential part in El 
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commutes with Q): 

[H,Q] = [T,g] = i(V-u) + u-V, (4.13) 

where ^ 

u(r) = -— Vg(r) (4.14) 

m 

is a local velocity field, as will be discussed in Section 4.3 below. (Note 
that only the gradient of the velocity potential Q{r) enters the following 
development.) The moments (4.9, 4.10) are then functionals of the local 
density p(r) and the velocity field u(r). The moment nii is easily found 
to be 

mi [g] = mi [u] = ^ 

The moment m3 [u] becomes more complicated but can be readily calculated 
from p(r), u(r), their derivatives, and from the KS wavefunctions, using 
the current-density functional theory [115]. (See [110] for the most general 
expressions for ma[u]; special cases within the ETF model have been given 
in [27].) Performing the variation (4.7) then leads to the following eigenvalue 
equations for the excitation energy hto: 

= (M^^PWui(r) = TOu;2p(r)uj(r) {j = x,y,z) (4.16) 

(see [110,111] for the explicit form of the left-hand side above). The solution 
of these equations yields the lowest excitation energy huj = hoji and the 
corresponding velocity field Ui(r). As sketched above, the second excitation 
energy Tllo2 is then found by restricting the solution u(r) of (4.16) to be 
orthogonal to Ui(r), and so on. 

Due to the complexity of ms[u], equations (4.16) represent a set of 
coupled nonlinear fourth-order partial differential equations for the three 
spatial components of u(r), which are close in spirit to the variational 
equations [116] derived from the so-called fluid dynamical approach [117, 
118] developed in nuclear physics. Different from the approximations used 
there, however, equations (4.16) can be solved using the exact quantum- 
mechanical KS wavefunctions of the ground state. The semiclassical ap- 
proximation implied by (4.12) then affects only the collective excitation 
mechanism. 

Equations (4.16) are still quite formidable to solve numerically without 
further restrictions on the form of the velocity fields u(r). (For some first 
attempts, see [110] and [111]). An appreciable simplification arises when 
u(r) or, equivalently, the operator g(r), is expanded on a finite basis of 
local functions, as discussed in the following section. 
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4.3 Secular equation using a finite basis 

We now expand the local excitation operator Q{r) in a finite set of basis 
functions Q„(r): 



g(r)G{Q„(r)} n = l,2,...,M. (4.17) 

The variational equation (4.7) then becomes simply a secular equation for 
coupled harmonic vibrations generated by these operators: 

dei\lCmn-<jjlBmn\ = Q, m,n = 1,2, M (4-18) 

yielding the collective excitation energies HlUi, that represent the finite-basis 
LCA to the exact spectrum. Here the mass tensor Bmn and the restoring 
force tensor ICmn are given by 

B,nn = h^{0\[Qra,[H,Qn]]\0), (4.19) 

ICmn = {0\[[H,Qm],[[H,Qn],H]]\0), (4.20) 

which, apart from some reshuffling of constants, are the nondiagonal gen- 
eralizations of the moments mi and m 3 defined in equations (4.9, 4.10). 
After solving equation (4.18) one finds the intrinsic local excitation opera- 
tors Qi/(r), which create the states \v), as linear combinations of the basis 
operators Q„(r): 

M 

QAr) = J2CQn{r). (4.21) 

n—1 

The physical meaning (see also [27,106,107]) of the operators Q,y(r) shall 
in the following be illustrated in the so-called “scaling approach” [105,106]. 
This is the simple case where a collective excitation is created by one sin- 
gle operator Q(r), i.e., using M = 1 in (4.17) and (4.18). The collective 
flow of the particles can be described by a velocity field Vc((r,t) which is 
proportional to u(r) 

Va(r,t) = d(t) u(r) , (4.22) 

so that Q(r) (apart from a constant factor) plays the role of a velocity poten- 
tial The collective coordinate a(t), which undergoes harmonic oscillations 

a{t) = Qfosin(wt), (4.23) 

defines the time dependence of the single-particle wavefunctions by 

ipi{r, t) = ifi{v, a{t)) = 



(4.24) 
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where the “scaling operator” S is defined by 

5= [T,g] = i(V-u) + u- V. (4.25) 

Taking the time derivative of (4.24), the time dependent density 

Pa{r,t) = p{r,a{t)) ^ (4.26) 

Si<Ep 

is immediately seen to fulfill the continuity equation 

^p„(r,t) + V • j„(r,t) = 0, (4.27) 

where 

ja{r,t) = pa{r,t)va{r,t) (4.28) 

is the local current distribution describing the collective flow of the excited 
system. This is the reason why we call the present scheme the “local- 
current approximation” (LCA) - in contrast to the RPA scheme where the 
currents are implicitly given by nonlocal particle-hole operators. The mass 
parameter B is now given by (c/. (4.15)) 

B = m j u(r) • u(r) p(r) d^r, (4.29) 

and the restoring force parameter /C is obtained from the “scaled” ground- 
state energy {a\H\a) = E[p{v,a)\ according to 

JC = (0|[5,[^,i7]]|0) = [^(0|e“^i7e-“^|0) 

J a^O 

= ■ ("^- 30 ) 

J a=0 J a=0 

Since A[p(r, a)] on the right-hand side can be considered as the potential 
energy V (a) of a collective Hamiltonian 

i7coii(a, d) = \b 6? + V{a), (4.31) 

we see that the solution of equatoin (4.18) in this one-mode picture is just 
the first excitation energy of (4.31) in the harmonic approximation: Hlu = 
EsiQ) = \j!C/B. If one identifies the intrinsic excitation operator Qiy) with 
the external operator Qext by which the system is probed, this corresponds 
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exactly to the simple sum rule picture [105] (or, in solid state physics, the 
so-called plasmon pole approximation). 

The extension to the LCA scheme with several basis operators (4.17) is 
now evident: one couples several harmonic oscillators, each corresponding 
to one individual collective degree of freedom (t) , described by a velocity 
potential Qn(r), and diagonalizes this system via equation (4.18). This 
corresponds to a collective Hamiltonian with M coupled degrees of freedom, 
which in the harmonic approximation becomes 



1 



M 

HcoW = 2 

m.n—l 






mn 



1C 



mn ^m^n 



)■ 



(4.32) 



We now use this method to calculate the response of the system to an 
external field, characterized by an operator Qpxt- The strength function 
-S'Qext i.E) is defined by 



Sq.^E) = 5{E - hu,). (4.33) 



Hereby the spectrum {\v),huj,^} may be the exact one given by 
equations (4. 2-4. 4), or that obtained in any approximation. From the 
strength function one defines the fc-th energy- weighted moments mk{Qe-xt)' 



(^ext) 



POO 

/ E>^SQ^^,{E)dE = 

Jo 



^(?iO..)'=|(^z|gext|0)|^ 



(4.34) 



which can be related to experimental observables. For instance, the pho- 
toabsorption cross section (j(w) is in the long-wavelength limit given by 

a{uj) = Sd{E = huj), (4.35) 

3c 

where 

A> = Qext = ez (4.36) 

is the electric dipole operator and lu the frequency of the external electric 
field (assumed to be polarized in the z-direction) . The above moments are 
then simply given as the following integrals over the cross section: 



m\{D) = J a{uj)duj, 1713 ( 0 ) = j uj'^a{uj) du>. (4.37) 



The moment mi(D) can be calculated immediately and yields the model- 
independent Thomas-Reiche-Kuhn sum rule (in atomic physics called the 
“/-sum rule”): 



mi(D) 



= J a(uj) duu = 2 tt'^ 




— 

me 



(4.38) 




210 



Atomic Clusters and Nanoparticles 



where Z is the number of electrons taking part in the collective excitation. 
The experimental verification of the sum rule (4.38) thus helps to identify 
the collective nature of a resonance. The observed plasmon resonances in 
alkali clusters typically account for ~ 60 - 80% of the total dipole strength 
(see, e.g., Sect. VIII of [40]). 

The moment m_i(Qext) can be shown [119,120] to be proportional to 
the static polarizability Opoi of the ground state with respect to the external 
field t^ext 

i(f?ext) — 2 (4.39) 

This gives a convenient way to calculate static polarizabilities once the col- 
lective spectrum of a system is known. 

By construction, the spectrum {\v),huj,^} obtained by solving the LCA 
secular equations (4.18) fulfills the sum rules mi{Qext) and m 3 (Qext) ex- 
actly, if the operator Qext is contained in the basis {Q„(r)}. (This is, of 
course, trivial in the one-mode picture discussed above, where the only lo- 
cal operator Q{r) is taken to be the external excitation operator.) It is 
now a matter of physical intuition to guess the form of the intrinsic exci- 
tation operators of a given system by making a suitable choice of the local 
basis {Q„(r)}. In connection with the optic response, it is obvious that 
the dipole operator D (4.36) should be contained in this basis. Suitable 
basis sets for applications to metal clusters will be given in the following 
subsection. 

4.4 Applications to metal clusters 

For applications of the finite-basis LCA to metal clusters using the jellium 
model, the following set of local operators in polar coordinates (r, 9, (f>) has 
proven to be very efficient and simple to use [27, 107]: 

QL(r) = {Yim p )] , (4.40) 

where Yim{9^<j)) are the spherical harmonics in polar coordinates {r,9,(j)); 
m = 0 and m = ±1 correspond to the different polarizations of the exter- 
nal dipole field. For spherical clusters, where the angular momentum I is 
a conserved quantum number, a set of four to eight basis operators with 
I = 1 and p = 1,3,5,... or p = 1, 4, 7, . . . are often sufficient to get a good 
estimate of the lowest states of the collective dipole spectrum, in particular 
in the jellium model. For deformed clusters, several values of I must, of 
course, be included in the basis (see, e.g. [121] for calculations in the de- 
formed jellium model). In calculations where the ionic structure is taken 
into account [122, 123], the following set of operators has been added to the 
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basis set (4.40): 

gL(r) = -ejiikr) i {i) , ^) + {^ , ^)] , (4.41) 

where ji{x) are the spherical Bessel functions. We shall in the following 
present some results obtained with these basis sets. 

4.4.1 Optic response in the jellium model 

As a first test of the LCA, we show in Figure 15 the collective spectrum 
obtained in [107] for the spherical neutral clusters Nag and Na 2 o, calculated 
in the spherical jellium model by solving the KS equation in LDA. The 
upper parts give the results of a fully microscopical RPA calculation [124]; 
the lower parts give the results of the LCA using the basis (4.40) . The main 
peak is split in the RPA result for Na 2 o; this is understood in terms of an 
interference of the collective oscillation with a specific Iplh excitation [124] 
(so-called “Landau fragmentation”). It remains to be seen if a solution of 
the unrestricted LCA variational equations (4.16) is able to reproduce this 
splitting [111]. Apart from this, the agreement between RPA and LCA is 
excellent; in particular, the m 3 and mi dipole sum rules exhausted by the 
main peak(s) below 3.2 eV agree within less than one percent. 

Figure 16 shows static electric dipole polarizabilities of sodium clusters, 
plotted versus the cluster radii Ri. Black dots with error bars are the 
experimental values; the other symbols are results of spherical jellium-model 
calculations. The square boxes were obtained in the TDLDA [29,125,126]. 
They are perfectly well reproduced by the LCA calculations of [107] via 
the m_i sum rule relation (4.39), using the operator basis (4.40) and the 
full KS approach for the ground state. When the latter is replaced by a 
variational ETF calculation, the average results shown by the dashed line 
are obtained [27]. 

The main discrepancy between theory and experiment is here due to the 
lack of ionic structure in the jellium model. Inclusion of ionic structure gives 
a substantial improvement of the theoretical results [127,128], in particular 
if the finite temperature is also taken into account [129, 130]. 

4.4.2 Optic response with ionic structure 

We finally give some examples of LCA calculations that include the ionic 
structure. In the “cylindrically averaged pseudopotential scheme” 
(CAPS) [131], the ions are treated three-dimensionally by Monte-Carlo sim- 
ulated annealing using a local pseudopotential, but only the axially averaged 
total ionic potential is included into the DFT treatment of the electrons. 
This has the advantage that the KS equations only have to be solved on 
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Fig. 15. Collective spectra of the spherical clusters Nas and Na 2 o obtained in 
the jellium model. The positions of the vertical lines give the eigenenergies found 
by solving the secular equation (4.18), their heights give their percentage of the 
dipole mi sum rule. The ground state was obtained by solving the KS equations. 
Upper panels: RPA spectrum [124], lower panels: LCA spectrum [107]. 



a two-dimensional grid (with cylindrical symmetry), which allows to treat 
larger systems with an affordable numerical effort. Kiimmel [110] has con- 
structed a local “soft” pseudopotential that reproduces the correct bulk 
properties of sodium and at the same time the 3s energy level of the valence 
electron in the sodium atom (see also [123,132]). It should therefore be 
suitable for the use in clusters that interpolate from the atom to the bulk. 
Indeed, the ground-state structures obtained with this pseudopotential in 
CAPS calculations reproduce the experimental bond lengths in the smallest 
sodium clusters as well as all-electron ab initio calculations [110,127]. 

In Figure 17 (top part) we show the photoabsorption spectra obtained 
for Na 2 and Nas in CAPS calculations using the LCA for the collective ex- 
citation spectrum (note that the widths are phenomenological like in most 
TDLDA or RPA calculations). The positions of the main peaks repro- 
duce the experimental peaks (bottom part) very well. Also shown (middle 
part) is the result of a typical TDLDA calculation [133] using the Troullier- 
Martins pseudopotential, which yields a slight blue shift of the peaks due 
to a systematic underestimation of the bond lengths. 

Figure 18 shows the structures and collective dipole spectra obtained for 
the charged clusters Najy and Na|]^ using the same scheme. Again, the LCA 
calculations [122] yield an excellent reproduction of the experimental peaks 
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Fig. 16. Static electric dipole polarizabilities a of sodium clusters. Dots with 
error bars: experimental results [80]. Open squares: TDLDA results [29,125,126]. 
Solid line: LCA result using the m_i sum rule from the KS ground state [107]. 
Dashed line: LCA result using the m_i sum rule from the ETF ground state [27]. 
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Fig. 17. Photoabsorption cross section a (in arbitrary units) versus energy 
(in eV) for the neutral clusters Na 2 (left) and Nag (right). Top panels: LCA 
results [110,127]; center panels: TDLDA results [133]; bottom panels: experimental 
results [134]. (From [110]). 



in the photoabsorption cross section, including the high-energy shoulder 
seen in NaJ;^. 

We conclude by noting that an approximate icosahedral structure was 
found in the same set of CAPS calculations for the ground state of the Na^g 
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Collective model CAPS 





CAPS 




Fig. 18. Right: ionic structures of and NaJ,^ obtained in the CAPS [122]. 
Left: photoabsorption cross sections a in arbitrary units, plotted versus energy 
ineV. Upper parts, dots: experimental results [136]; lower parts: theoretical results 
obtained in the LCA [122]. 



cluster [123,132]; it has been confirmed (with only minor non-axial devi- 
ations) in DFT-LDA molecular dynamics calculations without symmetry 
restrictions [135]. 



I have profited substantially from my collaborators and students. Without their contri- 
butions and encouragement much of the material included in these lectures would not 
exist. I am particularly indebted to Joachim Blaschke and Stephan Kiimmel for allowing 
me to include some partially unpublished results of their Ph.D. work. Several parts of 
the work presented here were supported by the Deutsche Forschungsgemeinschaft. 
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1 Introduction 

Superconductivity has been discovered almost a century ago. The under- 
standing of the underlying mechanisms took approximately fifty years of 
efforts and required the input of several outstanding physicists such as 
London, Landau, Bogoliubov, Abrikosov, Bardeen, Cooper and Shrieffer 
to name only a few. In the mean time, the existence of the phenomenon 
has been established in several areas of physics [1,2]. Most often, the size 
of the system is much larger than that of the microscopic building block 
of superconductivity: the Cooper pair. For instance, this is the case for 
the aluminium, copper or nobium wires at very small temperatures which 
are widely used in magnets when high magnetic fields are needed. In the 
second half of this century, the long searched superconductivity of liquid 
Hes involving pairs with new spin properties was established for its two 
predicted phases. More recently, and somewhat unexpectedly, new families 
of materials have been discovered which display superconducting properties 
at temperature sometimes exceeding hundred Kelvin. Superconductivity is 
also conjectured to exist in neutron stars. In particular, it has been invoked 
to explain the glitches detected in the long term evolution of the frequency 
of some pulsars. 

At the other end of the length scale, many nuclear features are influ- 
enced by pairing correlations [3]. Those show up prominently when prop- 
erties of odd and even nuclei are compared. Among them, one can men- 
tion level densities near the ground state, binding energies, charge radii, 
moments of inertia, spontaneous alpha decay lifetimes and fission cross 
sections. As a matter of fact, the last property can be said to have led 
to the first application of (admitedly nature controled, rather than man 
made) superconductivity with a significant impact on the life of the aver- 
age citizen. Indeed, fissile nuclei in the thermal nuclear reactors (producing 
today approximately 15% of the world electricity) are always neutron-odd 
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(233u, 235pj^ 239py lesser degree ^'^^Pu) while fertile nuclei (233Th, 

23®U) are neutron-even. Although the even-odd proton character plays a 
smaller role in neutron induced fission, the dynamics which leads to the 
splitting of an heavy nucleus into two smaller fragments is favored for even 
Z elements such as Th, U, Pu and Cm as compared to Pa, Np and Am. 
More recently, with the advent of large size arrays of gamma detectors, the 
investigation of high spin properties has established the strong similarity 
between the reaction of a nucleus to a rotation field and that of a metallic 
superconductor to a magnetic field. In particular, the nuclear moments of 
inertia are very sensitive to growing angular momentum [4]. Although it 
was much more poorly documented as it is today, the relevance of the super- 
conductivity concepts to the understanding of nuclear structure was evident 
to many discerning physicists already fifty years ago. As a result, within 
a year of the publication of the BCS founding paper, Bohr, Mottelson and 
Pines [5-7] had given a description of the microscopic structure of nuclear 
Cooper pairs which is still valid today. 

On the other hand, nuclear dynamics is concerned with rather different 
conditions than those prevalent in studies of metallic superconductors. It 
takes place at zero or very small (compared to the pairing gap) excitation 
energy and more importantly in systems where a canonical description (as 
compared to grand canonical) is very much needed. For instance, it is clear 
that a meaningful discussion of odd versus even effects requires that experi- 
ment and/or formalism keeps a good ( albeit not necessary absolute) control 
of the number of particles in the system under consideration. In that re- 
spect, recent experimental breakthroughs in the production and control of 
superconducting microscopic metallic islands and aluminium grains at the 
nanometer scale have established a link between the nuclear and condensed 
matter perspectives as regards superconductivity [8-11]. For instance, the 
spectroscopy of tiny grains subjected to a magnetic field display features 
reminiscent of that of a rotating nuclei. These systems are also of great 
help to understand why and how odd-even staggering effects loose impor- 
tance as the system size and the temperature grow. We shall see that a 
characteristic of such small systems is that their dimensions become com- 
parable and even smaller than that of a Cooper pair. This makes impossible 
an observation of the effects connected with that spatial dependence of the 
phase of pairs which is known to be essential in the most spectacular man- 
ifestations of superconductivity in macroscopic devices. With nuclei and 
nanoscopic grains, we are considering a domain for which Anderson once 
coined the appelation “dirty supraconductors” [12]. In relation with them, 
he suggested that an interesting problem was to establish how small a sys- 
tem could be and still remain supraconducting. This question can be also 
reformulated in a more theoretical langage by asking the relevance and the 
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limitations for finite systems of the symmetry breaking concept which is 
basic to our present understanding of superconductivity. 

At the outset, I have to say that this lecture does not provide the an- 
swers to these important questions. Rather, in qualitative terms, it outlines 
the ingredients of the problem and use them to review a number of the 
approaches which can be used to study superconductivity in small systems. 
These notes also contain a summary of a recent discussion on the connec- 
tion between the odd-even staggerings of binding energies of nuclei and of 
atomic clusters, and the existence and the magnitude of pairing correlations. 
Finally, some lately published theoretical works for systems at zero and fi- 
nite temperature in relation with Anderson’s question will also be briefly 
described. My hope is that despite its limited scope, this text will give the 
student some insight into the range of methods which have been devised to 
investigate superconductivity concepts for finite systems and help him find 
which one is the most appropriate for a description of the pair correlated 
object he might be interested to analyze. 

A realistic description of physical systems often requires complex effec- 
tive Hamiltonians and much more elaborate methods than those which are 
presented here. For instance, rather sophisticated Hamiltonians are essen- 
tial to obtain an accurate description of nuclear properties. As a matter 
of fact, the analysis of the influence of pairing correlations on the high an- 
gular momentum behavior of nuclei is one of the most useful tool for the 
determination of the spin, isospin and spatial dependence of the nuclear 
two-body effective interaction [13]. Nevertheless, for pedagogical reasons, 
these notes use a very schematic dynamics which mimics only those basic 
features of nuclear or condensed matter physics which are essential for my 
purpose. For instance, the mean-field part of the Hamiltonian is simplified 
to a diagonal one-body operator. In addition, the lectures describe the effect 
of pairing correlations by means of the simplest two-body potential which 
retains the features necessary to start an investigation of superconductivity. 
For the same reasons, the formalism is kept at the simplest possible level 
so that formal calculations involve only two by two matrices and no refined 
mathematics. By doing so, I have kept the size of the presented material 
within limits compatible with the teaching schedule and, may be, have set 
up a workframe such that students can easily perform for themselves the 
calculations required to cover the intermediate steps skipped in the notes. 



2 Basic mechanism: Cooper pair and condensation 

For the sake of simplicity, in these notes, we consider that the one particle 
subspace has a finite dimension fl. Nevertheless, when it is convenient, the 
sums are replaced by integrals and vice versa without worry on problems 
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which one might incur when a continuum instead of a discrete formulation 
is used. Sections 2 and 3 give a short and schematic presentation of the 
BCS ideas and formalism and establish our notations. 

2.1 Condensed matter perspective: Electron pairs 

Let us start with a reminder of the mechanism which leads to the formation 
of Cooper pairs [1,14]. At zero temperature T, we consider two electrons 
in orbitals above the spherical fermi sea of a system with Nq fermions. We 
take the kinetic energy as zero order hamiltonian Hq and we assume that 
the individual orbitals are described by the tensor product of a plane wave 
and a spin vector jcr = ±1/2). The energy of a single particle state with 
momentum p = fik \s 

Cfc = h^P/2m, (2.1.1) 



where m is the mass of the particle. 

Following Cooper’s argument, we want to prove that even the weakest 
attractive interaction between the two fermions will bind them into a pair. 
This is a remarkable feature since in free space two particles only form 
a pair when the attraction between them is strong enough to overcome 
the quantum fluctuations associated with the kinetic energy. An order of 
magnitude of the energy associated with such fluctuations is fi^/{2md’^) 
where d is the typical distance at which attraction is maximal. In particular, 
it is very large for short range interactions. 

From the wave-functions of the two particles, one can build a pair wave- 
function 4^1 2 which is translation invariant. Here, we will impose the total 
momentum to be zero since we are looking for the state of lowest energy. 
Then, the general form for 2 is 

4^1,2 CX ^ (2.1.2) 

|fc|>fcp 

where the are expansion coefficients still to be determined. The Pauli 
principle imposes that only unoccupied states (above the Fermi surface) 
are involved in the expansion. The symbol 5(5) stands for the total spin 
function of spin S. When 5 = 0 the state is spin antisymetric while it 
is spin symmetric when 5=1. Since dr 12 must be antisymmetric with 
respect to the exchange of both space and spin coordinates, when 5=0 
(2.1.2) becomes 



4'i,2 oc ^ /gcos[£(fi - r 2 )j ^(5 = 0). (2.1.3) 

\k\>k-p 
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For S' = 1, the cosine is replaced by a sine function which vanishes when 
fi = T 2 - Because it is short range, the effective attractive force between the 
electrons favors the cosine spatial dependence and therefore the total spin 
S = 0. Thus, the optimal ^ should be searched as a mixture of two-body 
states involving electrons of opposite momentum and spin. 

For a given potential V{\r\ — r 2 \), the Schroedinger equation which de- 
termines the energy E( 2 ) of the hypothesized pair is 

(S(2)-2c,)4-= ^ (2.1.4) 

|fc'|>fep 



where Vj p is the Fourier transform of the interaction. For the sake of 
simplicity let us take the extreme case of a zero range attractive force V (x 
—G5{ri — T 2 )- Then, Vg p = —G for any pair k, k' and equation (2.1.4) can 
be solved as 



h = G 



E/fc-> 
2cfc — E(2) 



(2.1.5) 



In equation (2.1.5), the sum over k' can be replaced by an integral over the 
kinetic energy wde where w denotes the density of (doubly degener- 

ated because a = ±1/2) levels at the Fermi surface. Note that to account 
for the schematic character of the two-body potential, one has introduced 
an energy cutoff A. In the limit when A is much larger than the expected 
binding energy of the pair 



2e-p — E( 2 ), (2.1.6) 

the solution of equation (2.1.5) is 



2 

E( 2 ) - 2ep = -2Ae w G . 



(2.1.7) 



It is obviously negative for any (positive) value of G and w. 

The expression (2.1.7) shows that that the Cooper mechanism depends 
on the existence of a fermi sea. Indeed, if( 2 ) vanishes as w goes to zero. For 
a set of Nq spin 1/2 fermions homogenously filling a volume V, the density 
at T = 0 is given by 



Nq fcp (2meF)^/^ 



( 2 . 1 . 8 ) 



where k^ and ep are the Fermi momentum and energy, respectively. As a 
consequence, the level density w = 1/2 {dNt^/de-p) is 



UNq mkp 

4ep 2h^ TT^ 



(2.1.9) 
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Thus w (and A( 2 ) — 2ep) tends to zero when Nq, or ep, or V vanish. 

Using equation (2.1.5), a calculation of the expansion coefficients /g 
can also be made. It shows that these coefficients are the largest for the 
orbitals at the Fermi surface and they vanish rapidly as Cfc increases. Such a 
concentration of the probabilities on orbitals near the Fermi surface is also 
an essential feature of the BCS approximation and all the other which are 
described below. 

2.2 Nuclear physics perspective: Two nucleons in a shell 

In most nuclei, the number of neutrons N is larger than the proton number Z 
and the two Fermi surfaces occur for shells with different quantum numbers. 
Then, from overlap wave-function considerations, pairing between identical 
particles is favored over neutron-proton pairing which is only expected to 
play a role in TV = Z nuclei. In these notes, we will thus only consider pairs 
made of identical particles. 

Nuclei are almost spherical objects. Therefore a natural description of 
individual neutron or proton states involves spherical shells which are la- 
beled by their orbital quantum numbers I, their spin j and a radial quantum 
number n. The degeneracy of a spherical shell is 2j + 1. For two identical 
nucleons (protons or neutrons) in a such a shell, the number of configu- 
rations compatible with the Pauli principle is j{2j -1-1). Let us consider 
a zero range attractive interaction —VS(ri — Tz) between these two parti- 
cle [15]. Since it only acts in states of relative angular momentum I,- = 0, 
the total spin of the pair must be S' = 0 for the same reason as above. 
Then the total angular and orbital momentums are equal (J = L). The 
paired states can be organized in multiplets associated with even values of 
J ( J = 0, 2, 4, . . . , Jm = {2j — 1)). A lengthy, not particularly instructive 
in itself, and straightforward manipulation of the angular momentum alge- 
bra [16] provides the energy of the 2J-I-1 degenerate paired states associated 
with the total angular momentum J 

Ej = -G ^ ^ i i) ’ G=V drunij{r)\ (2.2.1) 

In this expression, there appears a Racah 3j symbol and an integral involv- 
ing the fourth power of the radial wave function Unij(r) of orbitals in the 
shell. 

For heavy nuclei, we can consider the limit j 1/2. Then, by approxi- 
mating the 3j symbol by its asymptotic expression one obtains 

Eo = -G (^j + , Ezc^-^ Gj , Ej (2.2.2) 
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The non degenerate J = 0 pair is thus approximately four times more bound 
than any other state. Moreover, the energy of the states vanishes as their 
angular momentum J grows. This analysis indicates that the privileged 
pair structure taking full advantage of the short range attractive nature of 
the effective nucleon nucleon interaction corresponds to the wave-function 

i 

'i’1,2 = Unlj{ri)Unlj{r2)'^{S = 0 ) (2.2.3) 

m— — l 

where (r^, 9i, 4>i) denotes the spherical coordinates of the particle i and 
are the spherical harmonics. This two-body wave function corresponds to 
a fully symmetric combination of pairs involving {(to, a), (—to, — ct)} pairs 
just as superconductivity in condensed matter involves the quantum num- 
bers k, a and —k, —a. 

2.3 Condensation of Cooper's pairs 

From now on, we shall work with the formalism of the second quantification 
and consider states as vectors in the Fock space. This space is built from a 
vacuum of particles |0) by means of the repeated action of a set of particle 
creation operators. These operators denoted aj, and o| are associated with 
some reference single particle basis. In practice, it is convenient to select 
the basis which diagonalizes the one-body operator corresponding to our 
reference Hamiltonian 



Hq — ^ ) €p (^GpClp -j~ CLpCLp} . (2.3.1) 

P 

In equation (2.3.1), for the sake of simplicity, the energies of both states 
a],|0) and o||0) are taken equal to Cp. The correspondance with Sections 2.1 
and 2.2 is thus 

{p, p} ^ |(fc,cr), (-fc,-cr)| ^ {(n,;,TO,cr), {n,l, -m, -a)}- (2.3.2) 

The degeneracy assumed in equation (2.3.1) holds in condensed matter (e^ 
only depends on |fc|) and in spherical nuclei. As a matter of fact, the levels in 
the mean-field of deformed (axial or triaxial) nuclei calculated for instance 
with the Hartree-Fock method also exhibits a twofold Kramers degeneracy. 
In experiments performed on small aluminium grains, Hq does not corre- 
spond to the plane wave Hamiltonian (2.1.1). The shape of the well in which 
electrons move is randomly determined by the process of congealation of the 
grain. Still each state should be spin degenerate. The degeneracy is only 
lifted by time-odd external operators such as a magnetic or a rotation field. 
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In such cases, a description of pairing requires a formalism which is only 
slightly more complicated than what is described below. 

From the above discussion, we are led to select a two body state built 
from elementary pairs {aj,a]j} and to define 



1^1.2) = ^^|0), (2.3.3) 

with 

-§t = ^ —44- (2.3.4) 

tip 

P ^ 

Our notation for the coefficients entering the definition of anticipates that 
of the BCS formalism and introduces coefficients Up and Vp which satisfy 
the relation 

4 + 4 = 1. (2.3.5) 

Based on the intuition that a fermion pair should in first approximation 
behave as a single boson, people have tried to approach the exact Ag-body 
wave-function by a condensate of such pairs, that is I'hjVo) oc (5'^)^”/^|0). In 
practice, because calculations with are difficult to handle, Bardeen, 

Cooper and Schrieffer introduced an approximation which retains the un- 
derlying physics and, at the same time, allows the use of the well established 
and efficient set of mean-field methods [17]. The BCS wave-function that 
they proposed in order to describe a superconducting system at T = 0, is 

l^'pcs]) cx exp(,S'l')|0) oc + Up44) 1^)' (2.3.6) 

P 

In contrast with (5'l')^°/^|0), the BCS approximation violates the particle 
number symmetry and [dr [3^3] ) does not have a well defined particle number 
Nq. One must therefore constrain the coefficients Up and Vp to satisfy the 
equation 

(|'I'[BCS]I-^I'I'[BCS]^ = -^0, (2.3.7) 

where N is the particle number operator 

N = ^ (a^pCip + 4®p) ■ (2.3.8) 

p 

3 Mean-field approach at finite temperature 

Much of the information on small superconducting systems has been ob- 
tained at very small temperatures well below the critical temperature. Stric- 
tly speaking, on the earth (as opposed to stars), nuclei are observed at zero 
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temperature. The description of their excitation properties by means of 
a formalism involving the concept of temperature, is always an approxi- 
mation whose validity depends very much on the experimental conditions. 
Nevertheless, temperature is a useful tool whenever experiments only access 
properties averaged over a fraction or all of the excitation spectrum. More- 
over, understanding temperature effects in superconducting finite systems 
has a relevance in connection with Anderson’s question and is certainly use- 
ful for condensed matter physics. Therefore, I have chosen to work within 
a finite temperature formalism. This section begins with a discussion of 
the BCS theory and introduces notations and techniques employed in the 
methods which are proposed later to improve the description of pairing in 
finite systems. 

3.1 Family of basic operators 

At finite temperature, physical systems are described by their density oper- 
ator D which contains all the information on their dynamics and accounts 
for thermal statistics averaging effects. The BCS theory seeks to determine 
an approximation of D which gives a good prediction of the most important 
properties associated with superconductivity. 

3.1.1 Duplicated representation 

Because BCS breaks the particle number symmetry, it is convenient to con- 
sider a set of Ft/ 2 subspaces of dimension four, which take into account both 
the creation and the destruction of particles associated with pairs {p,p). 
Thus, we introduce the following column and line vectors 

Up ^ 

! , 4 “p “p ) > (3.1.1) 

aj, ^ ^ 

4 / 

whose components satisfy generalized fermion anticommutation relations 
with 

/ 0 0 1 0 \ 

0 0 0 1 

10 0 0 
\ 0 1 0 0 / 




cr = 



(3.1.3) 
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The matrix cr (not to be confused with the spin magnetic component used 
above) satisfies = 1 and <7^ = a where the letter T stands for matrix 
transposition. 



3.1.2 Basic operators 

Let us now consider a family of operators acting in the Fock space which we 
will use to construct various approximations of the density operator. The 
operator which is the representative of this family is defined in terms 
of a c- number and a set of p 4 x 4 matrices as 

f [^1 = e P . (3.1.4) 



The anticommutation properties (3.1.2) impose that satisfies 

—Cp^‘^ . In addition, one often requests that and therefore Cp^‘^ be 
hermitian. These conditions restrict the number of degrees of degrees of 
freedom in Cp^'^ to six real parameters. 

For the simplest form of the BCS theory, one can work with a parame- 
trization which relies on only two real parameters {lp^‘^\ and take Lp^'^'^ 
equal to 



1 


/ Cpt^i 


0 


0 


C [d] 
Dp 


II 


0 


[rf] 

l_xp 


c [d] 
Dp 


0 


0 


C [d] 
Dp 


n [d] 

Op 


0 




c [t^] 
y Dp 


0 


0 


n [d] 

Op 


where the coefficients Cp^'^^ and 


are related to 



(3.1.5) 



Cp^d] ^ cos(t»M) , S'p[^] = sin(4^1) . (3.1.6) 



Below, we will also need a simple extension of this parametrization involving 
another angle (pp with the replacement 



5p[^l ^ ^p[‘^lexp(ip[fl) (3.1.7) 

in the upper right 2x2 matrix in equation (3.1.5). The coefficients in the 
lower left submatrix are then adjusted accordingly when the hermiticity of 
£pt'^^ must be preserved. 
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The other degrees of freedom, which we will not consider in this text, 
are: 

• when the p, p degeneracy is lifted (a situation evoked in Sect. 2.3) it 
is necessary to introduce two distinct coefficients and 

• the last two degrees of freedom describe the mixing within the p,p 
subspaces. They do not bring any new physical information at the 
level of this simple version of the BCS approximation (they correspond 
to a rotation in the two dimensional degenerate subspace and therefore 
a redefinition of states p and p). On the other hand, they become 
interesting when extensions of BCS are considered (z.e. Hartree-Fock~ 
Bogoliubov) [18]. 

The form (3.1.5) of introduces a decoupling which allows one to only 
work with 2x2 matrices: 





R. 

II 


- 


^;p[‘^l(at ap- )Z^p[^l 


P 






V 


with 


II 




5pM \ 




V 5p[^l 


-Cpi^i j ■ 



Note that the matrix Up^'^^ is unitary: Up^'^^Up^'^^^ = 1. 



ap \ 

4 7 ’ 

(3.1.8) 

(3.1.9) 



3.1.3 BCS coefficients; quasi-particles 



The wave-function (2.3.6) is written in terms of standard BCS coefficients 
which are connected to Cp^‘^^ and by the relations 



— Up ^p 



5 ['"1 = 2 uMu [^ 1 , 



(3.1.10) 



so that equation (2.3.5) is verified. From Up and Up, we define a pair of 
quasiparticle creation operators and 



= Mp* ^ ap + a. 



t 

'p “p 



= u^p^ap - 



ap= uPtp 



Op Op 



(3.1.11) 



which satisfy fermion anticommutation properties. 

With the choice (3.1.11), the exponent of takes a diagonal form 

i-i.i'fl + 1: i/'] - Y. ) 

f[^l=e P ,e P . (3.1.12) 
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3.2 Wick theorem 

One of the interesting features of the operators is that calculations of 
expectation values of one, two, many body observables can be effected by 
means of the Wick theorem. The form (3.1.4) is one of the simplest having 
this property. Other operators allowing the usage of the Wick theorem are 
exponentials of more general quadratic forms of creation and destruction 
operators. A possible starting point to demonstrate the Wick theorem is 
the relation 

( M f M = TpMf ["1 f M , (3.2.1) 

V 4 / \ J 

where the 2x2 matrix Tp^‘^ is defined as 

= exp = cosh Ip - sinhlpl^lfipl^l (3.2.2) 

Hint: to prove equation (3.2.1), consider and as 

functions of Ip^^^, derive a differential equation and solve it. 

Any thermodynamic property can be obtained from the partition func- 
tion Tr D where the notation Tr indicates a trace taken over the Fock 
space. The evaluation of the quantity Tr is thus a prerequisite to any 
thermal analysis. Using the quasi-particle representation (3.1.12) one finds 

Trfl^l =exp(-lo'‘'')ndet(l-kTp['^l) . (3.2.3) 

P 



The Wick theorem states that the expectation values of an observable Q 
(g)[^l = Tr g) /Tr (3.2.4) 

can be expressed as a polynomial of the matrix elements of the 2x2 matrices 




Note that the last equality in (3.2.5) generalizes the usual form of the oc- 
cupation factor in Fermi statistics. Hint: use equation (3.2.1). 

For instance, for the particle number operator (2.3.8), one obtains 



= E + (l - («p4)‘"')] (3-2-6) 

P 
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For the energy of the superconducting system, we consider the simplest 
possible type of pairing Hamiltonian 

^ = ^0 - ^ Gpqalalagaq , (3.2.7) 

pq 



where Hq is defined in equation (2.3.1). The matrix elements of the pairing 
interaction satisfy Gpq = Gqp and (only justified by my desire to shorten 
the formulas in these lectures) Gpp = 0. Such an interaction scatters the 
pair from any fully occupied level q to any empty p level. 

In fact, instead of H , we will rather work with the shifted operator 

k = H -pN, (3.2.8) 



which allows one to treat with similar notations the canonical (/i = 0) or 
grand canonical (/r yf 0) situations. By means of the Wick theorem, we 
readily obtain 



(7^)M 






p) ^kpdp'^ -\- 1 



.pCfcp/ 



- XI K4) Kog) 

pq 



(3.2.9) 



Below, we will have to consider first order variations of quantities such as 
£{7^W} with respect to arbitrary variations of the matrices 5TZp^^\ From 
such a variation, the matrices 






Cp- p \ 

Ap ^ —€p -\- p J 



(3.2.10) 



naturally appear. In equation (3.2.10) the quantity A|f^ is the pairing gap 
associated with and H. The quantities A|f^ and are related to 
7^p'^^ by 



AM = -^Gp,(a,a,)M, A^ 

P 



X Gpq{alal)^^l 



(3.2.11) 



When is hermitian and A|f^ are equal. This is the case when 
is given by (3.1.5) (in addition A|f^ is real). 

With these definitions, the variation of the energy (3.2.9) takes a com- 
pact form 

= X 'H{n^^}snp^‘^^ (3.2.12) 

p 

where tr stands for the trace of the 2x2 matrice. 
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3.3 BCS finite temperature equations 

The grand canonical BCS theory relies on the simplest possible approxima- 
tion for the statistical density operator: 

p[BCS] = (3.3.1) 

where is optimal according to a standard variational principle of the 
thermodynamics. Within BCS, the parameters 0p^‘^} are those which 

minimize the approximate Gibbs free energy Fq calculated with where 
Fg is 

Fg = {K)-^S=-^IiiZg , (3.3.2) 

whith the entropy defined by 

S = -Tr {VlnV)/TrV + lnTr {V) (3.3.3) 

(Note: Eq. (3.3.3) takes into account that T> is not necessarily normalized.) 



3.3.1 Density operator, entropy, average particle number 



From (3.2.3), one obtains the expression of the partition function 



Tr p[BCS] ^ 



2 cosh 



Id]' 



(3.3.4) 



The density (3.2.5) is calculated explicitly as 



[<i] _ 

/V-n — 



(l - , 



i = tanh 



F 



(3.3.5) 



Then, one calculates the BCS entropy S'!®®®! and the necessary expectation 
values (Q)^bc® 1 = The results are 



S'[BCS] = ^ ^2 In ^2 cosh ^ j , (3.3.6) 

for the entropy, 

(iV)[BCS] ^ i:(i , (3.3.7) 

P 

for the particle number and 

(^)[BCS] =J2(ep- p) (l - tpI'^lCpl'^l) - i ^ , 

pq 



for the (shifted) energy. 



(3.3.8) 
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3.3.2 BCS equations 

We can now write from its definition (3.3.2). and express its sta- 

tionarity with respect to the parameters which define We note that, 
in the BCS formalism, the normalization factor does not play any role. 

Variation with respect to 

It yields the equation 

0 = (cp - ^ (3 3 9) 

where the gap defined by equation (3.2.11) is 

APCS] = i ^ . (3.3.10) 

After introducing the quasi particle energies 

ejfcs] ^ ^(ep-^)2 + (A|?cs])2 ^ (3 3,^1) 



one obtains an implicit solution of equation (3.3.9) for the quantities Cp 
and 



, [BCS] 

C- [d] ^ 

P [ROSl 



^ [d] _ Cp ~ M 
P [BCS] ■ 



Equation (3.3.10) can now be rewritten in the standard form of the finite 
temperature gap equation 



A [BCS] 
p 




A [BCS] 

f 1 

e^[BCS] ^9 



(3.3.13) 



Variation with respect to lp^‘^ 

It gives a second equation 

= (ep - + AlfCS]^^[d] ^ (3 3 ^^ 4 ) 

which by means of equation (3.3.12) shows that is equal to up 

to a temperature factor 

IpM =/3e|f°®l. (3.3.15) 

These results can be reformulated by means of the definition (3.2.10) and 
the equations (3.1.9) and (3.3.12). The matrix satisfies 

^{.^^[BCS]} ^ . 



(3.3.16) 
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An equivalent form is obtained by using equations (3.3.15, 3.2.2) and (3.2.5). 
The BCS density matrix is solution of the self-consistent equation 



7^pPcs] 



1 



(3.3.17) 



3.3.3 Discussion; problems for finite systems 

Many textbooks present and analyze the rich physical content of the BCS 
formalism in the context of condensed matter physics (see for 
instance [1,2]). Here, I only discuss some of its limitations when it is used 
for finite systems. These are related to the breaking of particle number 
symmetry which, for instance, makes an interpretation of the quasi particle 
energies somewhat ambiguous. 

For a discussion of this point, consider Figure 1. On the left side, the 
quasi particle spectrum is plotted (with a 90 degree tilt) versus the 

single particle energies Cp. Both spectra are discrete so that only the in- 
tersections of the solid curve with the short horizontal segments have to be 
considered. To investigate the meaning of e|f^, we first consider a situation 
with the pairing interaction turned off. The solid line is then replaced by 
the broken dotted line which is the union of the bisectors of two quadrants. 
On the right part of the figure we show that the upper dotted line contains 
information on the excitation energy of the state with N + 1 particles versus 
the state with N particles. Such excitations, can for instance, be measured 
as the Q factors of a pick-up reaction transfering one particle from a pro- 
jectile into the A^-particle target. On the other hand, the lower part of the 
dotted line tells about the excitation energies of the system with N —1 par- 
ticle (as obtained for instance by means of a stripping reaction) with respect 
to the A^-particle system. No part of the curve has any relation with the 
excitation spectrum in the A^-particle system (rightmost spectrum) which 
involves particle-hole excitations. Returning now from the dotted curve to 
the solid line, one sees that its physical meaning evolves continuously from 
top to bottom. When Cp fi, the curve tells about the system with N + 1 
particles. On the other hand, its lower part is more related to the N — 1 
system. Although it remains an interesting outcome of the theory to dis- 
cuss properties of odd systems, the interpretation of the curve for e « /r is 
not without ambiguity. This is in contrast with infinite systems, where its 
shape can be tested by means of an analysis of the density of conduction 
states [1]. 




H. Flocard: Pairing Correlations in Finite Fermionic Systems 



241 





E* 



, 0,1 



hole particle-hole 

N-l/N N/N 



Fig. 1. Interpretation of the BCS quasiparticle energy spectrum for finite systems. 
From left to right are plotted the quasiparticle energy spectrum Cp (horizontal 
axis) versus the discretized single particle energy tp (horizontal segments crossing 
the vertical axis). The three rightmost spectra correspond to the excitation energy 
of the N + 1, N—1 and N systems in the absence of two-body (pairing) interaction. 
In the three cases, the reference energy is the ground state energy of the system 
with N particles. 

3.3.4 Discussion; size of a Cooper pair 

There is another notion introduced in the discussion of an infinite super- 
conducting medium whose importance requires some reevaluation in the 
context of small systems: the correlation length, sometimes also called the 
size of the Cooper pair. This quantity ^ is the typical dimension of a zone 
such that two particles in the quantum state 4'i_2 can be simultaneously 
extracted from (destroyed in) the many-body state. The quantity which 
describes this process is 

K{r,-P) = (l>(F)4>(r))P®®l , (3.3.18) 

where 4)(r) (resp. $(r')) is the field destroying one particle of p type (resp. p) 
at point r (resp. r'). The expression of these fields is 

= X! = X! (3.3.19) 

p p 

where ^p(r) (resp. ^p(r) are the single particle functions associated with p 
(resp. p) state. In finite systems, K depends on the two coordinates r and 
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r' and its interpretation is difficult. On the other hand, in infinite systems, 
where because of translational invariance K depends only on |r — r^|, it is 
easier to define the correlation length Indeed, an analysis of the long 
distance behavior of the function K(0,f) shows that it is proportional to 
the product of an oscillating function sin(fcF|?^|)/|f1^^^ by an exponential 
function exp(— |r]/^). The range ^ of the exponential which governs the 
dominant term in the fall-off of K is precisely the correlation length. The 
same analysis provides its expression in terms of dynamical parameters 






kp 

mA 



hvp 



(3.3.20) 



where up = fikp/m is the speed of a particle at the Fermi surface. As 
a matter of fact, the formula (3.3.20) can also be found by means of the 
following heuristic argument due to Anderson. The energy variation SE 
associated with the destruction of a pair into two asymptotically free par- 
ticles is A = = h^kpSk/m = uf<5|p| = vph/6\f]. The quantity (5|r| can 

be assimilated to the size ^ of the Cooper pair 4'i^2- Note that ^ increases 
as A decreases. 

Let us now apply the formula (3.3.20) to finite superconducting systems. 

• In nuclei, the gap A is approximately equal to 1 MeV. The Fermi level 
being typically 40 to 50 MeV excited with respect to the bottom of 
the nuclear well, the Fermi velocity vp is about 0.3 the light velocity. 
We thus find that an order of magnitude for ^ is 50 fm; it is 5 times 
larger than the radius of the heaviest nucleus; 

• For a small spherical metallic grain, by means of equation (2.1.9), one 
can write the dimensionless ratio of the correlation length to the radius 
a of the grain as ^/a ~ 2.8 10^ w/(A a^) where w~^ and A are given in 
meV and a in nanometers. Typical values realized in experiments [10] 
are A = 0.2 meV, wA = 2 and a = 10 nm. In such a case, the ratio 
^/a exceeds hundred. The situation is even more extreme than in 
nuclear physics. 

As a conclusion of this section, one can say that in small systems such as 
nuclei and nanoscopic grains, the notion of correlation length looses rele- 
vance. It is the physical dimension of the system which sets the microscopic 
scale of the superconductivity phenomenon. 



3.4 Discussion: low temperature BCS properties 

In order to prepare comparisons with the methods exposed later, let us 
analyze the limiting behavior of the BCS theory when T = 1/P 0. Since 

the quasi particle energies are positive by definition, the limit of the thermal 




H. Flocard: Pairing Correlations in Finite Fermionic Systems 



243 



factors is 1 — 2exp(— /3ep ). In nuclei or very small aluminium grains, 
the temperature T can become smaller than any other energy scale. Then, 
an expansion of the gap equation (3.3.13) gives 



APCS] G ( 1 - 



-1 

Similarly, the entropy (3.3.6) becomes 



^[BCS]^2/3yelfCS]g-/3e[f^®\ 



(3.4.1) 



(3.4.2) 



For larger systems such as microscopic metallic islands, one may instead 
use the continuous limit of condensed matter physics. In this limit, T is 
smaller than all energy scales except the single particle energy spacing 1 /w 
(see Eq. (2.1.9)). In general, as in the discussion of Section 2.1 one also 
takes all matrix elements Gpq equal to the same value G over an energy 
range [— A/2, A/2] centered at the Fermi surface. We will use the BCS gap 
A at T = 0 as our reference pairing energy. This gap is the solution of the 
equation 



^ wde 

G J-A/2 + A2 



(3.4.3) 



The condensed matter low temperature limit corresponds to the conditions 






( 3 . 4 . 4 ) 



In such a situation, a useful expansion formula is provided by the approxi- 
mation 



g-/3\/e2 + A2 








(3.4.5) 



where i5(e) and S"{e) are the dirac distribution and its second derivative, 
respectively. By means of equation (3.4.5), the equations (3.4.1) and (3.4.2) 
are transformed into 



and 



A[bcs](^)^A 




(3.4.6) 



= 2u>A4/27r/3Ae“^^. (3.4.7) 

These formulas as well as equations (3.4.1) and (3.4.2) display an unsatis- 
factory feature of the BCS theory which restricts its application for small 
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superconducting systems: the temperature dependence is governed by the 
factor e~^^. As in the discussion on the interpretation of the quasi par- 
ticle energies, one finds that the BCS low temperature expansion formulas 
are appropriate neither for an even system for which the argument of the 
exponential should be — 2/3A, nor for an odd system for which the argument 
is expected to be close to — (3 / {2w‘^ A) . 



4 First attempt at particle number restoration 

4.1 Particle number projection 

Working in canonical formalism corrects the problems which in BCS are 
caused by the breaking of particle number symmetry. To this end one 
introduces the projection on the subspace with Nq particles 

1 . i,n(N - 

(4.1.1) 



Pno = ^ . 



This operator is hermitian {Pnq = Pno) satisfies P^^ = Pnq- From the 
expression (2.3.8) of N, one sees that the exponent within the integral can 
be written in a form similar to (3.1.8) 



iipN = iipfl + ip ( aj ) 



1 0 
0 -1 



A 



(4.1.2) 



A comparison with (3.1.6, 3.1.8) and (3.2.2) leads directly to the following 
correspondance table 

—ipPl 






I Id] 

Ip 

^ 0 . 



-up 



(4.1.3) 



If one introduces the following notation for the non hermitian operator 

flv] ^ ^ipN ^ (-4 4^ 

which satisfies the relation the projection can therefore be 

written 

Pno = 7T . (4.1.5) 

Jo 

The matrices Tp associated with are diagonal and p independent 



(4.1.6) 
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4.2 Projected density operator 

Let us define a projected mean- field density operator by 

1 p‘ 2 , 7 T •/// /\A 

= / / dip" d(fi' - P)Noflv"]fld]fl-v'] 

47t2 Jo Jo 



One notes that both sides of are multiplied by the projection because 
does not commute with N as soon as one of the d|f^’s is non zero. 
Using the group structure of the set of operators, one defines 

'flv",d,-ip'] ^ p[ip"]p[d]p[-ip'] ^ (4.2.2) 



The associated 7p matrices are also obtained by a multiplication law 

Tpl^"d,-v'] =Tp^v"]j-^[d]j-^[-v']_ (4 2.3) 

With the notation p = p" — p' , according to equation (3.2.3), the partition 
function can be written as 

Tr = At / [ dp" dp' 

47 t Jo Jo ( 4 . 2 . 4 ) 

xripdet(l + Tp[^"’^--‘"'l) . 

Switching from the variables {(f", (f') to ((^, (p'), one obtains a more compact 
expression 



Tr = — / dp p j^id.v) ^ (4 2 . 5 ) 

27 ^ Jo p 

with 

= 2 -I- coshlpt'^^ cosp — zsinhlp^'^^ Cp^'^^ sin(^^ (4.2.6) 

From equation (3.2.5) we obtain the generalized density matrices 

1 

(4.2.7) 



n, 













[ip",d,-ip'] 
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whose explicit expression is 



n, 






D 



(d,v) 



+ ^coshlpt'^1 — sinhZpf'^l e*'^ — sinh e**^ e^**^ 

sinh^pt^'l 5p['^l e-*‘^e-2*V?'l + (coshZpt^l+sinh^pI'^l CpI'^l) ^ 

(4.2.8) 



4.3 Expectation values 

For any function f{p) of the gauge angle tp, let us introduce the more 
compact notation 

((/(<p)))M ^ ^ d<^eI-^o‘"' + - ^o)] D^d.^) /(<^) , (4.3.1) 



so that, for instance, one has Tr = ((l))^'^] . With this notation, the ex- 

pectation value of an operator Q for the projected density operator (4.2.1) is 



(g)[^«i 



Tr X>[^olg 
Tr Vi^o] 



1 




p 2 tt 



dp' (Q) 



[ip+ip' ,d, — ip'] 




(4.3.2) 

The quantity {Q)\v+v ,d,-v\ being defined as in equation (3.2.4) can be 
calculated by means of the Wick theorem with the density matrix (4.2.8). 
In most cases, it turns out that the integral on p' in (4.3.2) can be performed 
trivially. 



For the calculation of the expectation value of the particle number 
,d,-v'] ^ use (3.2.6) and obtain 



No = (iV)l^ol = 



((1))M 



E2 



1 -I- (coshZpf'^l — sinh?p['^lCpt‘^^)e*‘f^ 



D 



{d,v) 



(4.3.3) 

It is important to note that (4.3.3) is neither an equation nor a constraint. It 
is an identity valid for any set Cp^'^^ as long as has a component 

with Nq particles 0). 
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For the average value 1 of the energy, we use (3.2.9). The 

projected energy is then obtained 

= /\-o, l + (cosh?pM-sinhl,Mc,M)e»‘^ 

^ ^ ((l))t^l \\V 4"’^^ 

2-^ P'1 T^{d,ip) p.{d,ip) 

pq ^p 

4.4 Projected BCS at T = 0, expectation values 

It would seem that we are in position to obtain all the contributions to the 
Gibbs free energy (3.3.2) . This is not the case. The direct approach based on 
the choice (4.2.1) for the projected density operator turns out to be a dead 
end. Indeed, one does not know how to evaluate the entropy (3.3.3) because 
it involves the calculation of {ln{pNgT^'^^ Thus, the solution of 
the finite temperature projected BCS problem will have to rely on other 
methods such as those which are described below. Nevertheless, the results 
obtained in Sections 4.2, 4.3 are sufficient to treat the problem at T = 0. 
Indeed, the determination of the ground state only requires a minimization 
of the projected energy (4.3.4). 

We still have to transform the results of the preceding section and ob- 
tain their limit at zero temperature. According to equation (3.3.15) this is 
obtained by letting all the quantities go to -l-oo in proportion to the 
variable [3 = 1/T. Keeping the dominant terms in all the expressions, we 
find that it is convenient to define the quantity 

= cosLp — sin(^, (4.4.1) 

and for any function f{ip) to introduce the notation 

((/(<p)))l'^l ^ ^ dp f{p). (4.4.2) 

Then, the partition function becomes 

Tr oc e p ((l))[f' . (4.4.3) 

The diverging prefactor exp[— integral part of the lim- 
iting process. However, because it appears in the numerator and in the 
denominator of all the calculated expressions, it disappears from the phys- 
ical quantities. 
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Then, the limit of (4.2.8) gives the zero temperature density matrices 






(T4.4) 

from which one derives the expression of the expectation value of N us- 
ing (3.2.6) 



n, 






2D, 



(d,v) 

0 V 



No = 






1 




(4.4.5) 



As (4.3.3), this is not an equation but an identity valid for any set of coef- 
ficients {Cp^^^^}- 

For the projected energy we use (3.2.9) and obtain 



P 



1 

((!))[,'’ 




1 

4 






[d]_ 



((!))[ 




(d,(p) 

Oq 




(4.4.6) 



4.5 Projected BCS at T = 0, equations 

The variation of (4.4.5) with respect to provides the following set of PI 
equalities 



No 



d 






[d] 







^Op 



[d] 

0 



I sm e 



ILp 



^Op ^Oq 




(4.5.1) 
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On the other hand, the minimization of the energy (4.4.6) gives a set of O 
equations 



d 






[d] 



dd'f 



C 1 



((W2 , 



Gpg Sq 



[d] 



[d] 






S. 






D, 



(d,v) 

Op 



epB^^ + i sin ip 



(1 - G,[^l)e*<^ 



^2 ~ 

i¥^p 



D, 



(d,v) 

Oq 






C [d] q[d] 
Oq Or 



4 ^ T-)id,tp) p.(d,ip) 

q,r^p ^Oq ^Or 



[d] 



(4.5.2) 



4.6 Projected BCS at T = 0, generalized gaps and single particle shifts 

The elimination of i9/9r9|f^(ln ((l))[f^) in equation (4.5.2) yields the varia- 
tional equations which have to be solved in terms of Cp^‘^ and 



0 = ^p'^'(ep - - Gpt^lAfol. (4.6.1) 



These equations are formaly similar to the BCS ones (3.3.9). However, they 
contain a state dependent energy shift /dp^°^ 



-1 





1-G„ 



i sin ip 

T^{d,^p) 

d-'Op ^Oq 



(4.6.2) 



1 

4 



(7 Q M Q[d] 

2_^UqrOq (d.cp) ^(d,<p) ^(d.cp) 

q,r^p ^Op ^Oq ^Or 




One should note that is not a Lagrange parameter as p is in the stan- 
dard BCS formalism. Indeed, there is no constraint on the average particle 
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number to fix the since the T = 0 limit of the density operator 

corresponds to a state with exactly Nq particles. The quantities are 
self consistently calculated from the and 
The gap is also state dependent and equal to 



-1 




[d] 

• (4.6.3) 



The projected BCS solution is therefore formally identical to BCS (3.3.12) 



q [dl ^ [dl £p Mp 

- [iVo] ’ - JNo] 

t-p Cp 



(4.6.4) 



once quantities similar to BCS quasiparticle energies have been intro- 
duced 

^ . (4.6.5) 

One can also write a generalized T = 0 gap equation 



a[-^o] 

P 




A [-^o] 




-1 



^Or 




[d] 



(4.6.6) 

Finally, one notes that the average value of the particle number calculated 
with the unprojected density operator T (z.e. (fV)[‘^l = is 

not equal to Nq. The equality is that given by equation (4.4.5). 



The method described in this section is only one among the several which 
have been devised to study projected BCS (or Hartree-Fock-Bogoliubov) 
equations at T = 0 [18,19,21]. Other methods rely on the Thouless theorem 
and the steepest descent technique. On the other hand, in most calculations, 
additional constraints on {N)^‘^ are introduced in order to stabilize the 
evolution of the numerics. 



In some cases (strong pairing), the integrals over tp can be approximated 
with good accuracy by a gaussian approximation about tp = 0. In other 
limits (weak pairing), it may be sufficient to replace the integrals by sums 
over a limited number (4 to 8) of angles tp. 
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5 Stationary variational principle for thermodynamics 



5.1 General method for constructing stationary principles 



In Section 4.4, we have seen that, in the canonical formalism, the straight- 
forward minimization of the Gibbs Free Energy is facing difficulties linked 
to our unability to evaluate the entropy (3.3.3) for density operators such 
as (4.2.1). In this section, we describe an alternative variational principle 
which does not require the direct calculation of the entropy. It involves the 
search for a saddle point rather than a minimum solution. 

We first sketch a general procedure for constructing variational princi- 
ples which has been analyzed by Balian and Veneroni over a series of arti- 
cles [22,23]. It concerns the following class of problems: given a quantity 
of interest /(x*), function of some variables {x*}, one wants to determine 
the value of / for the solutions {xq} of a set of equations {(/^(x®) = 0}. In 
many interesting cases, solving exactly this problem is impossible because, 
either one cannot solve the equations {^^(x*) = 0}, one cannot calculate the 
value of / for the solution Xg, or both reasons. 

In the method discussed by Balian and Veneroni, an additional variable 
is associated with each equation gj{x^). The action J is then defined as 



J{x\ = fix'') 9j{x')- (5.1.1) 

3 



It is a function of both sets of variables {x*} and Formally, the latter 

play the role of Lagrange parameters. When a full variation of with 
respect to x* and is done, the equations gj = 0 are recovered. Then, 
since the stationary values of the variables x* and wi are such that x* = Xg, 
one has J{x\, wl} = /(xg). 

In general, a full variation is not possible. On the other hand, it can 
sometimes be performed for restricted trial spaces of the x*’s and the tzji’s. 
Then, depending on the adequacy of these trial spaces, the approximate 
stationary solution may be sufficiently close to the exact saddle point of the 
functional J to yield a good approximation of the quantity /(xg). Figure 2 
illustrates the spirit of the method. 
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Fig. 2. Contour lines of the functional (5.1.1). The axes w and uj represent 
the spaces associated with the variables a;* and cu-’, respectively. The surface J 
exhibits a saddle point behaviour. The height of the saddle point is the quantity of 
interest f{xl). The point marked with {x, uj} stands for an approximate solution 
which lies in the vicinity of the saddle point. The value of the action J evaluated 
at {a;,ci7} is equal to the seeked value f{xs) up to second order terms in x — Xs 
and vj — cug. 



5.2 Stationary action 

We now apply this general procedure to the evaluation of the thermody- 
namics properties of a system [24] . 

5.2.1 Characteristic function 

At equilibrium, the density operator of a system is given by the Boltzmann 
Gibbs distribution calculated with the Hamiltonian K (3.2.8) 

£) = exp(-/3iC), (5.2.1) 

where K = H (resp. K = H — iiN) in the canonical (resp. grand canonical) 
formalism. The thermal averages of any set of relevant operators {Q-y} can 
be derived from the characteristic function 

exp(<p(C)) = Tr 1(C)£>, 



(5.2.2) 
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in which appears the generating operator 

i(^) = e . (5.2.3) 



Indeed, when all quantities ^-y vanish, the characteristic function (/j( 0) gives 
the partition function Tr D. Furthermore, the first derivative of (p with 
respect to provides the expectation value of the operator Qy 



dtp(g) 

diy 



|{=o — 



(5.2.4) 



while second derivatives yield correlations and fluctuations of the {Qy} op- 
erators. When feasible, higher derivatives give access to all other cumulants. 



5.2.2 Transposition of the general procedure 

For the problem that we are considering, the variables introduced in 
Section 5.1 are the matrix elements of density operators in an appropri- 
ate subspace of the Fock space. For instance, for the canonical case, this 
subspace is deflned by the projector if Nq is the number of particles. 
The values {a;g} are the matrix elements of the Boltzmann Gibbs opera- 
tor D (5.2.1) in this subspace. 

On account of the discussion in the previous section, we select the char- 
acteristic function (5.2.2) as quantity of interest playing the role of the 
function / in Section 5.1. 

For the constraints gj{x'^} we take the Bloch equation 

-h i \kt>{T) + t>{T)k] = 0 , (5.2.5) 

dr 2 L J 

whose solution over the interval [0, P] determines the matrix elements of 
the Boltzmann Gibbs operator. This equation is solved with the boundary 
condition 

22(0) = i, (5.2.6) 

where 1 is the unity operator in the Fock subspace associated with all the 
exactly preserved observables. For instance, in the canonical case, one has 
1 = Pnq- The number of equations gj = 0 associated with the Bloch equa- 
tion is therefore the number of matrix elements of the density operator T> 
times the “number” of pseudo times values t. Thus, as Lagrange parame- 
ters , we can take the matrix elements of a T-dependent operator A{t) . 
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According to the prescription (5.1.1), the functional J is then 

J{D{t), A{t)} = Tr V{(3)A{0 - drTr A(r) 

xr^ + ^[KT>{T)+T>{r)K] 

where Tr denotes now a trace in the relevant Fock subspace. In the 
variables are T>{t) (0 < t < /3) and A{t) (0 < t < /3). 

The variation of this action with respect to A{t) gives the equation 




Tr 6A{i 



dP(r) 

dr 



1 

2 I 



kt>{T) + t>{T)k 



= 0 . 



(5.2.8) 



As expected from Section 5.1, the most general variation with respect to 
A{t) gives the equation (5.2.5). The variation of the action with respect to 
T>{t) yields 



Tr 6V{t) 




1 ■ 
2 . 



kAk) + A{T)k 



= 0 . 



(5.2.9) 



(Hint: first, transform the action (5.2.7) by means of a part integration 
over T, then vary) while the variation with respect to 27(/3) gives 

Tr<5P(/?)(i(C)-i(/3))=0. (5.2.10) 



5.2.3 General properties 

In this section, we discuss the properties of the formalism when some general 
conditions (satisfied by most trial subspaces) are imposed on the variational 
spaces of V and A. 

We consider trial spaces for T> and A such that 

6A(xA, SVoz'D, (5.2.11) 

are acceptable variations (be. multiplication by constant is allowed) and 
such that 1 belongs to space of operators A. Then, one derives immediately 
the following equation 

e<p(C) = Tr A{^)V{f3) = Tr A(0) , (5.2.12) 

which for ^ = 0 determines the partition function in terms of 2?(/3) 

e7^(0) = Tr P(/3)^^„ . 



(5.2.13) 
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Other properties concern the evaluation of the expectation values of the 
Hamiltonian K, of the number of particle N and of the operators Qj. These 
are a direct consequence of the stationarity of the action within the trial 
subspaces. Indeed, the total derivative of the stationary value of the action 
Tu-l) with respect to any parameter (here the parameters are or 
/J-) is equal to the partial derivative of J with respect to this parameter 
evaluated at the stationary values of the variables {xl,wi}. Using this 
property, one derives the following equalities: 



and 



d^^ l«=o - (^7) 



Tr t>iP)Q^ 
Trt>{f3) 



(5.2.14) 



dy>(0) 

dp 



-^lnTrP(/3)|5=o = 



Tr V{P)K 
Tr V{P) 



l?=o 



(k) = E. (5.2.15) 



We can now define the free energy and the entropy as 

Fg = -^:^(0), (5.2.16) 

and 

S = P{E - Eg) = yr{0) - (5.2.17) 

Thus, in this method, the entropy is a secondary quantity obtained only after 
the calculation is completed. In addition, it does not have to be related to 
T){P) by equation (3.3.3). Nevertheless, our definition guarantees that for 
any trial space compatible with the above conditions, the entropy (5.2.17) 
satisfies the thermodynamic consistency relation 

S = -^- (5.2.18) 



6 Variational principle applied to extended BCS 

Here the adjective “extended” indicates that we are concerned by more than 
the partition function since the equations are designed for an evaluation 
of the characteristic function tp(^) for any value of the parameters {^}. 
Thus, in addition to the partition function, the action (5.2.7) gives access 
to expectation values, correlations and fluctuations. 
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6.1 Variational spaces and group properties 

The choice of the variational spaces for T> and A is inspired from Section 3.3 



i(r) = I 

P(r) = fM(r) ^ 0p'^'}- 



( 6 . 1 . 1 ) 



We now introduce notations which take into account the group property of 
the T’l'^l-like operators: 

rj^[ad] rj^[da] 

The group is non commutative since in general 
therefore 

7 - [ad] /-7~ [a] /T- [d] /-r- [da] rr [d] /-j- [a] 

p — dp Ip , Ip — Ip Ip 

In order to manipulate the algebra more conveniently, it is preferable to 
work with the following expression of 

= coshZpt'^^ — sinhlpt‘^^u|fl(T, (6.1.4) 

which is equivalent to equation (3.2.2). It involves a vector of length unity 
determined by 9p^'^\ In particular, this form of takes naturally 
into account an extension to complex values for the quantities Sp^‘^ such as 
mentioned in (3.1.7). Using the property of Pauli matrices (ox, <Jy, <Jz) 



( 6 . 1 . 2 ) 
We have 
(6.1.3) 



(6.(j)(c.(t) = (6.c) + lib X c).(?, (6.1.5) 



which is valid for any pair of vectors b and c as well as some of its conse- 
quences such as 



det(y -I- c.ct) = 7 ^ — c^, ( 7 -I- c.ct) ^ = 



72 _ ^2 



(7-c.ct), (6.1.6) 



(here 7 is a c-number) one obtains an explicit expression of Tp^°'‘^ 
Tp = cosh Ip cosh Ip -I- sinh Ip sinh Ip .v}p^ 
cosh sinh lp^°‘^v}p^ + sinh cosh 
—i sinh sinh (m|,“^ x ti|f^) 



(6.1.7) 



The matrix Tp^°'‘^ being the product of two unitary matrices is also unitary. 
Thus, when the bracket is written as sinh the first term is equal to 
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cosh?p^“‘^^ and the form (6.1.4) is retrieved. The noncommutativity property 
q-^[ad\ ^ manifests itself in the last term of the bracket. 

According to equation (3.2.5), the associated density matrices are 

and the required evaluations of traces are obtained from equation (3.2.3) 

Z = Tv AV = e~^o'“' “ Y[ det (l + . (6.1.9) 



6.2 Extended BCS functional 

We now have to specify the generating operator A{^) (5.2.3). We assume 
that it also belongs to space of operators as all the A(r)’s. This means 
that the operators Qj are one-body operators: 

= (^l^p + 4®p) > (6.2.1) 

P 

where is a collection of El c-numbers. From equation (6.1.9) one 

calculates = Tr A(^)'D by means of an appropriate introduction of 
quantities such as = 0)and Tp'^^\ 

The first term in the integral of the action is obtained by deriving 
and with respect to r in the expression (6.1.9). After some work, one 
gets 



Tr = Z 
dr 




+ ^tr 

P 




( 6 . 2 . 2 ) 



(Hint: use the following properties (I) Z depends on only through 
the multiplicative factor exp(— (H) det A = exptr InA for any ma- 
trix A; (III) if S stands for any kind of variation then (5tr In A = tr A“^i5A; 
(IV) = Tp['^l■^7^p[^“^Tp[^l). 

As the Wick theorem is valid for any operator of the type, from 
equation (3.2.9) one obtains the pseudo energy 



-Tr A 



KV + VK 



= -Tr K 
2 



VA + AV 



{ P J_ I 

j-rC-'[/v jj’ 

(6.2.3) 
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and the extended BCS functional X (5.2.7) 



J{P, A} = - f drZ 

Jo 



dl^ 

dr 



(6.2.4) 

By means of a part integration with respect to the variable r one obtains 
the equivalent expression 

r/3 



J{V, A} = - Z{f3) + Z(0) + f drZ 

Jo 



din 



+ ^tr Tp[“l 



dr 



(6.2.5) 

which is more convenient when variations with respect to Tp^‘^ have to be 
performed. 

6.3 Extended BCS equations 

Let us use equation (6.2.4) and express the stationary conditions of with 
respect to A{t). The variation with respect (t”) is easily performed since 
this quantity only appears in Z{t). 



din 



^trTpI^l + i + • (6.3.1) 



dr 



To vary with respect to one can use the properties 5£{R\d^ = 

Y, tr 71:{7^|f'}(57^p["^l and ,57^ = (1 - 7^)(5r(l - 7^) to obtain 



dTpI^l 

dr 



2 L 






(6.3.2) 



The corresponding equations deduced from a variation of the action with 
respect to 2 ?(t) are obtained in similar way from the form (6.2.5) of Cf . 
They are 



dr 



= ^ tr Tp 



-1- MdrM 1 

p -Vp 



+ , (6.3.3) 



and 



}Tp [“1 +Tp[“l7f{7^|f“l} 



dr 



2 L 



(6.3.4) 
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It remains to specify the form of the boundary conditions associated with 
equations (5.2.6) and (5.2.10). For the extended BCS problem as it is con- 
sidered here, they are 

Tp[‘^l(0) = l, (6.3.5) 

and 

, Tp [“!(/?) = Tpl^l. (6.3.6) 



6.4 Properties of the extended BCS equations 

The equations (6.3.2) and (6.3.4) can be solved first independently from 
equations (6.3.1) and (6.3.3). Then, and are obtained by means of 
an integration as 






dr' 






(6.4.1) 



and 

^o'“'(r) = + ^o'"'(/3) - ^o'"'(r). (6.4.2) 

From an analysis of the equations one checks the following two conservation 
laws 

^ = 0, ^ = 0, (6.4.3) 

which can be used to constrain the numerical algorithms. Combining equa- 
tions (6.3.2) and (6.3.4) one finds that the derivatives of and 

can be expressed in a commutator form: 



d7),[“‘^l 

dr 




(6.4.4) 



and a similar equation (with a minus sign in front of the commutator) 
for Tp[^“l. 

For the BCS case considered here, alternative formulations are possible. 
Indeed, from the expression (3.2.10) of H{TZ^p^}, this matrix can be written 
in terms of an appropriately defined vector as 






(6.4.5) 



Then, by means of the representation (6.1.4) of the matrices 7),^“^^ and Tp^°‘\ 
equations (6.3.2) and (6.3.4) can be expressed as a set of 2Ct coupled ordinary 
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differential equations plus torque equations acting on the normed vectors 
and u^p \ For instance equation (6.3.2) becomes 



din 



dr 



drt. 



[d] 






= - ( coth I, 



'“1 + .wlfl 



[d] 



uy X 



dr 2 

while equation (6.4.4) becomes 






1/ [ad] 

dr 



du 



i X 4^1, 

(6.4.6) 



ad] 

^ X u 



(6.4.7) 



dr 



;7M y f.M] 

p ^ '-i'p 



6.5 Recovering the BCS solution 

The BCS solution is obtained from the above equations when the problem 
is reduced to an evaluation of the partition function. Thus, all the param- 
eters are taken equal to zero and 

i(0) = i. (6.5.1) 

Then, one checks that if one introduces the operator 

w = ho + Y. (4«p) } f ""t V 

p \^pj 



where the matrix has been defined by equation (3.3.16) and ho 

is a c-number to be determined below, the operators 



V{t) = e“^^ , (6.5.3) 

and 

A(r) , (6.5.4) 

are solutions of the equations of Section 6.3 and are compatible with the 
boundary conditions. 

Indeed, from equations (6.5.3) and (6.5.4) one has = 

exp(— /31T) and thus 






(6.5.5) 
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As a consequence and defined according to equation (3.2.5) 

are r-independent and equal to which satisfies the selfconsistent 

equation (3.3.17). Thus one has 

(6.5.6) 

so that the matrix = exp(— tH{ 7^|P^®^}) is a solution of 

equation (6.3.2). 

According to equation (6.4.1) the value of the c-number /iq is determined 
by the variational equations (in contrast with the situation in Section 3.3 
for the BCS theory) and is given by 

ho = - ^tr (6.5.7) 

p 

From the general equation (5.2.4), the expectation value of any operator of 
the type (6.2.1) is given by 



Tr QjV{/3) 
Tr V{P) 






(6.5.8) 



where Cp^'^^ is given by equation (3.3.12) This expression is the same as 
given by standard BCS formalism. 



6.6 Beyond the BCS solution 



The first difference between the predictions of the extended and standard 
BCS formalisms appears when correlations and fluctuations are evaluated. 
Two situations are possible: the operators Qj commute with the 

Hamiltonian or they do not commute. 

In the first case, it is possible to reformulate the evaluation of exp(:p(^)) 
as the calculation of the partition function exp((p(0)) associated with the 
shifted Hamiltonian 

K = K + Y,^Q^ ■ ( 6 . 6 . 1 ) 

The quantities play thus the role of h for the operator N. One 

can then use directly the results of the previous section For instance, the 
fluctuation AN"^ is given by the formula 



AN^ 



1 av 






( 6 . 6 . 2 ) 



Thus, the grand canonical approximate fluctuation is proportional to the 
derivative of the approximate expectation value just as it is required by the 
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exact thermodynamical relation. The result (6.6.2) differs from the value 
{{N — (iV)PC®l) 2 )[BCS] would be obtained from a naive usage of the 

density operator V{P) = For instance, at T = 0, this formula gives 

a non zero fluctuation, while according to equation (6.6.2) AfV^ vanishes in 
agreement with thermodynamics. 

When the operators Qj do not commute with K, the evaluation of the 
fluctuation is more complicated. I refer the interested reader to 
Sections 4. 2-4. 4 of [24] for a proof. I will only state the result. With the 
notations of the Liouville space, the fluctuation of an operator Q is given by 



AQ2 




Kcoth -/3K 



F-i : Q, 



(6.6.3) 



where K is the RPA matrix, F the stability matrix and Q the matrix asso- 
ciated with the operator Q. 

When Q commutes with K, it corresponds to a zero energy mode of the 
RPA: Q : K = 0. Then, the parenthesis in equation (6.6.3) reduces to 2//3 
and one retrieves the usual expression 

AQ2 = ig : f-i : Q . (6.6.4) 

Therefore, the mean field formalism of generalized BCS takes also into ac- 
count the two body correlations at the level of the RPA. 

The evaluation of the fluctuations by means of equations (6.6.3) 
and (6.6.4) can be difficult since it involves working in the Liouville space 
whose dimension is In practice, it may be easier to return to 

equations (6.3.4) and (6.4.4), to solve them for several (at least three) values 
of to evaluate the function tp(^) and to compute the fluctuation by means 
of a numerical differentiation. 



7 Particle number projection at finite temperature 



7.1 Particle number projected action 



The generalized BCS method discussed in Section 5 can be extended to the 
canonical framework, by the following choice of trial spaces for T> and A: 



i(r) = fH(^) ^ 

T){t) = P^„f[^l(r)Pw„ ^ {Zo''"', l/\ • 



(7.1.1) 



It involves the same number of variational parameters as (6.1.1) With no- 
tations similar to those of Section 4.2 the explicit form of T>{t) is 



P(r) 




dp dp' f f ['^l(r) f 



(7.1.2) 
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We again introduce the more compact notations 

^Vp’\ ^ ,d,v'] = (7 1 3 ) 

which express the group properties. In equation (7.1.3), 7),^“^ and are 
defined as in equation (6.1.4) and is given by equation (4.1.5). 

Then, a straightforward generalization of equation (6.1.9) yields 

Z = TrAV=-^[ [ d(^d(^'e“*(‘^ + ‘^')^o ^[“’‘^>‘^’‘^'1, (7.1.4) 

47t^ Jq Jq 

with 

^[a,ip,d,ip'] _ gt((/5 + ip')^ — ^ 0 ^“' — (Jgl; _|_ ^ (7.1.5) 

P 



In order to write the action explicitly, we notice that it would have been 
equivalent to take A = PnoT^°'^ Pnq and V = T^‘^. Therefore, from a direct 
transposition of (6.2.4) we obtain 



J{T), A} = Z^~ 



-I ^277 p'Z'K ^ pj3 

j j dpdip'e~P‘fi + ‘^^^o j 



dl^ 

dr 






dr 






(7.1.6) 

In this equation, Z^ is obtained by replacing T[“1 by A and r = /? in 
equations (7.1.4) and (7.1.5). In addition, we have used the property 



£;{7^[v'.a.V.d]} _ £|7^[a.V.d.v']| 
which results from the commutation of K and N . 



(7.1.7) 



7.2 Number projected stationary equations: sketch of the method 

The variation of the action (7.1.6) with respect to is easily performed 
since this quantity only appears as a multiplicative factor exp(— ?o^“^) in 
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Z[o->v,<i,v One obtains the equation 







1 1 
Z 47r^ 




dip dip' ^0 Z\-°-’V,d,v'\ 



^trTpI‘^1 

- P 








(7.2.1) 

It is a generalization of equation (6.3.1) and, as this equation, it implies that 
is obtained by an integration once Tp^^'^r) and Tp^'^^r) are known. 

The variation of with respect to 7),^“^ gives a matrix equation of the 
type (6.3.2) or (6.4.6). It is however much more complicated since it involves 
angle averaged operators of the H type. For a detailed derivation we refer 
the reader to the Sections 5.2 and 5.3 of [24]. An analytic solution of these 
equations appears difficult and a numerical solution is not yet available. 



8 Number parity projected BCS at finite temperature 

In the introduction, we emphasized that in finite systems, many signatures 
of pairing correlations were detected in and measured from the staggering 
of properties with respect to the parity of the particle number. For binding 
energies, a critical evaluation of this point, will be done in the Section 9.2. 
At any rate, it appears that an interesting improvement over BCS of the 
physics of pairing can already be obtained by enforcing exactly the correct 
number parity while keeping a grand canonical formalism to constrain the 
the average particle number to have the correct value Nq. 

8.1 Projection and action 

The projection which extracts the correct number parity component out of 
a state is 

4 = i(i + ,7e-^), (8.1.1) 

where rj is equal to +1 for even systems and —1 for odd ones. 

In this section, we specialize the action (5.2.7) for the calculation of the 
thermodynamic function and the expectation value of one-body observables. 
Then, according to the discussion in Section 5.2, the generating operator is 
equal to the unit operator: A = 1 . 

The other boundary condition concerns the operator 'Dp for r = 0. It 
must be equal to the unity operator in the Fock subspace with the correct 
number parity. This implies that Dp(0) = Pp. 
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Because the projection (8.1.1) commutes with operators of the type, 
one can use the action (5.2.7) with an Ansatz very similar to the choice made 
in Section 6.5. As was done in this section, we introduce an operator 



Wr, = + E 4"' H^p) ] (8-1-2) 

p \®p/ 

where (implicit in and are the quantities which must 

be determined from the stationarity condition of the action. We look for 
solutions T>p (r) and Ap (r) of the form 

f>p(r) = Fpe~'^^p , Ap(t) = e~(^ ~ . (8.1.3) 



One notes that, because the projection commutes with Wp and satisfies 
Pp = Pp, it is not necessary to have a projection on both sides of exp(— rlF,;) 
as in equation (7.1.1). The operators VpAp and ApVp are equal and do not 
depend on r 



Vp{r)A{T) = ApiT)VpiT) = , 


(8.1.4) 


where T^p^ is defined as 


X^P] = Q~PWp 

To calculate the action J7, one remarks that 


(8.1.5) 


.. . 


^ = -WpT^P > 


(8.1.6) 



so that the integrand in definition (5.2.7) is constant. The integral is per- 
formed trivially and the action can be expressed as 



Jp = TrPpf^P^ 



1 + P{Wp 




(8.1.7) 



The density operator PpT^P^ has two components for the values 0 and tt of 
the gauge angle in the operator exp(iipN) introduced in Section 4.1. With 
the conventions of this section, we denote these two components and 

'plv,Tr] respectively. They are defined in terms of by 

rp[p,0] ^ rpip] ^ y[p,7r] ^ ^inNrplp] ^ (8.1.8) 

From equation (3.2.3, 3.3.4) and (3.3.5), for (p = 0, one obtains an explicit 
formula for the trace 

^[p.o] ^ f[p,o] ^ 

P \ 




(8.1.9) 
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and for the density matrix 
1 



TZ, 



= - (l - , tpM=tanh/?: 



M 



(8.1.10) 



The definition (3.2.9) provides the expression of and gives 






1 + PiWr, - K) 



_^[»7.0] 






( 8 . 1 . 11 ) 



In order to evaluate the second part the action, one notes that according 
to equation (4.1.5) the matrix is equal to —1, Then, one obtains the 
following equalities 

(8.1.12) 



7- [vM — _'T h.O] 



and 



h,o] 

7^ 



L-p 



27^p[’'’°] 






Using equations (3.2.3) and (8.1.12) we calculate the second trace 

2 

— C7[p,0] 



^[p.TT] ^ ^ g-/3/io'’'' q ^2sinh/3^^ = 



= (8.1.14) 



where the c-number is the product of all the thermal factors 






' p — j_ j_ i-p 

p 



[V]2 



(8.1.15) 



The calculation of the action can now be completed. One obtains 



l7p — Zrj 



1 + /3Uo'"'-E 



y- [q] 4- nr 

M - 



(8.1.16) 



1 + ryrp 

where we have introduced the following number parity averaged quantities 
l + r^rp - g{7^[^^°l} + 7?rpg{7^[^’"l} 






y — 

C.p — 



1 + ryrp 



(8.1.17) 



8.2 Variational equations 

The discussion of the variational principle in Section 5.2 and in particular 
equation (5.2.16) has established that the opposite of the logarithm of the 
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stationary value of the action f7^st, for a given set of trial spaces, should 
be equal to the optimal value of the product of the Gibbs Free energy Fq 
by the temperature. is determined by the stationarity condition of the 
action (8.1.16) with respect to the parameters which appear in 



a) Stationarity with respect to 

As appears explictly in the bracket and implicitly in through a 

multiplicative exponential factor, the variation is easy to perform. It yields 
the equation 






+ yrrft^p^ ^ 
I + r]rp 



-'T)’ 



( 8 . 2 . 1 ) 



Reporting this equation into equation (8.1.16) one ontains the expected 
result: 

Jrist = Zp. ( 8 . 2 . 2 ) 



b) Stationarity with respect to i9p[ri] 

This angle only appears in According to equation (3.2.11) one has to 

introduce the two quantities 



Ah.o] 

p 



c H+w aF: 

2 '~'pq>^q '-g 7 ^p 



1 (7 Q lv]f[v]-i 

2 '-'pq'^q '-g 



(8.2.3) 



Then, one checks that the following definition of a number parity projected 

gap 



p — 



1 + r]rptp^~^ 



(8.2.4) 



allows one to write the stationarity equation in a form similar to 
equations (3.3.9) and (4.6.1): 



0 = (cp - - A^pCpt’'!. 

A formal solution of equation (8.2.5) is 

c [»7] _ ^vp r< — ^p~ ^ 

— ~ 5 — 

p ^7] p 

where we have introduced the BCS-like energies 



(8.2.5) 



( 8 . 2 . 6 ) 



^VP 



= V («p “ Ai)^ + A. 



2 

VP ■ 



(8.2.7) 




268 



Atomic Clusters and Nanoparticles 



c) Stationarity with respect to 

fr?l 

This variation is more toilsome because Cp is present in almost all the terms 
in the action. Nevertheless, it is technically straightforward and it leads to 
equation analog to equations (3.3.14)and (3.3.15): 



2rj Trf fcp 
- ’qrrjt'^p 









1 - rir^t 



( 8 . 2 . 8 ) 



where the c-number is given by 



Jv] 

in in 



^i + ryPpLi Jz \ / 

(8.2.9) 

One notes that no equation equivalent to (3.3.14) and (8.2.8) appears in the 
zero temperature formalisms of Sections 4.5 and 4.6. 

The set of equations is now complete. As with BCS, the numerical 
solution of the problem is more easily performed when equations (8.2.5) 
and (8.2.9) have been given another more standard form via the elimina- 
tion of and From the definitions (8.2.3) and (8.2.4) and equa- 

tion (8.2.6), we obtain 



A _ 1 flv] 1 + ?? tp^ 



( 8 . 2 . 10 ) 



which is the analog of the BCS gap equation (3.3.3) from which it differs 
only through the last ratio in the R.H.S.. Another equation 



A _ A [p.o] _ 1 ^yg 

Zu q 



( 8 . 2 . 11 ) 



has no BCS counterpart. Finally, it is convenient to rewrite the quan- 
tity kp as 



^'■7 - F 



(M-- _ 1 + - 4"'’’') 

2{l + r]rp) I _ 



(8.2.12) 



X 1+ ^'/'P Y" ^ 

V 1 + ^^P p l-ryrpt'"’”" 



and to express the quasi-particle energies as 



eN - g _ 



App(Aj7’-l - Aj7’°l) 

+VPytp^~^') epj 



(8.2.13) 
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\r}] 

In this expression of Cp , the second term in the bracket can be evaluated 
by means of equation (8.2.11). 

One notes that this set of equations reduces to standard BCS when one 
sets rj = 0. In the gap equation (8.2.10) the modification introduced by 
the number parity projection concerns only the last ratio in the R.H.S.. Its 
dependence on ^ shows that contributions to the sum only come from 
terms such that is small. As expected, only single particle states near 
the Fermi surface play a significant role. The dependence on 77 is such that 
the fraction is larger (resp. smaller) than one for rj = + (resp. rj = —). The 
gap is thus increased (resp. decreased) with respect to BCS. 

8.3 Average values and thermodynamic potentials 

Using the expressions for and S'p^’*^ resulting from the solution of 

equation (8.2.6), one can write expectation values of all operators. For 
the energy, equation (8.1.7) yields 






(cp - p)~ 



(ep-7i)^ + iApp^tpM +?7rpt|7' ^ 



^r) p 



l + prp 



while the particle number average value is 



(8.3.1) 



(iV)!"! = 



Tr VpN 
Tr f>p 






E 



€p - p + r] Tp tp^ ^ 

Cpp l + rjrp 



(8.3.2) 

In the latter equation, the second equality results from equation (3.2.6) used 
with the parity projected density matrices 



77-p p — 



77., 



h.o] 



+ r7rp77p[’'’^l 



1 + ryrp 



(8.3.3) 



Since, although (/3) has the correct number parity, one is still working 
in the grand canonical formalism, the equation (8.3.2) must be used to 
determine p through the constraint 

(7V)[’'l = No . (8.3.4) 



From equations (5.2.13, 5.2.16) and (8.2.2) one obtains 
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which according to equations (8.1.9, 8.1.17, 8.2.1, 8.2.7) and (8.2.7) satisfies 
the thermodynamics relation 



dF^ 

dfi 



(iV)M = -No. 



(8.3.6) 



Finally, equation (5.2.17) can be used to get an explicit form for the mean- 
field parity number projected entropy 







l + Vr-n 



(8.3.7) 



to be compared with the BCS expression (3.3.6). In contrast with the 
derivation of BCS equations in Section 3.3 which were based on the prior 
evaluation of an expression of the entropy associated with the operator 
here, the entropy is only obtained at the end of the calculation as pointed 
out at the end of Section 5.2. 

Similar, although not quite equivalent, equations have been given and 
discussed in [25-30]. 



8.4 Small temperatures 

Since, up to now, small superconducting systems are experimentally in- 
vestigated mostly at zero or small (compared to the gap) temperatures, it 
is worthwhile to analyze the large [3 behavior of the equations derived in 
Section 8.4 



8.4.1 Even number systems 

For even systems, it turns out that first order terms in the expansions vanish. 
These should therefore be carried out to second order in order to extract 
the dominant temperature behavior. From the expansion of the thermal 
factors 
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one deduces that 

lnr+ -4^ e-M^' + o , (8.4.2) 

p ^ 

where is the smallest of the quasiparticle energies 

The extreme low (see meaning of this adjective in Section 3.4) temper- 
ature form of the projected gap equation (8.2.10) is 



^+p — 9 ^ ^pg 



A 



+ q 
e+p 



1 _ - 4 ^ 

r^p,q 



(8.4.3) 

For T = 0, one recovers the BCS equation which yields the solution (2.3.6) 
which is obviously a superposition of states with an even number of parti- 
cles. On the other hand, in contrast with equation (3.4.1) the temperature 
behavior is governed by terms involving the sum of two quasiparticle ener- 
gies. These correspond to the simplest excitations in an even system (see 
discussion of Fig. 1). As for BCS, an increase of the temperature induces a 
decrease of the gaps. 

At small T, the entropy (8.3.7) 



^[+l = 2/3^eW 

P 




-f2^e-^(' 

q¥=p 



,[+] 




(8.4.4) 



also has a correct temperature dependence in contrast with the BCS 
expression (3.4.2). 

The condensed matter low temperature limit (defined by the inequali- 
ties (3.4.4)) of the even parity number gap is 



A+(/3)=A(^l-4^^e-2/3^), (8.4.5) 

where A is the T = 0 BCS gap defined by equation (3.4.3). 

In the same limit, the even number parity entropy is 

SM = 87T(wA)2e“2/3A^ (8 4 g) 



which can be compared to equation (3.4.7). 



8.4.2 Odd number systems 

When rj = —, the analysis of the last ratio in the R.H.S of equation (8.2.10) 
requires some care. Indeed, since it takes the undetermined form 0/0, one 




272 



Atomic Clusters and Nanoparticles 



must first extract from the numerator and the denominator the dominant 
terms which behave as exp(— /3eg where Cq ^ is the lowest quasi particle 
energy among the e|, ^’s. 

Then the extreme low temperature limit of the equation (8.2.10) for 
p yf 0 is 






^pq 



q^O 



G. 



pO 



A-o „ A_5 

g-o 



,-/3(eg[ ' - e[) 0 



(8.4.7) 

At T = 0, one does not recover the usual BCS gap equation. Instead, one 
obtains a “blocked” equation in which the single particle levels 0 and 0 are 
eliminated from the sum. Thus, our formalism recovers automatically the 
variational equation which determines the coefficients Up and Vp (p yf 0) of 
the state 



I'I'fBlocked BCS]) — («p + |0) (8.4.8) 

p#0 



which replaces l^rpcsj) given by equation (2.3.6) for a BCS description of 
odd systems. The gap equation (8.2.10) not only contains a blocked equation 
as a limit, but it also selects the single particle state which must be blocked. 
The blocking of a state near the Fermi surface leads to a reduction of the 
gaps A_p at T = 0 as compared to A+p = A. 

In equation (8.4.7), the temperature dependence is governed by the quasi 
particle energy differences Cp ^ — Cq ^ In a perturbative scheme, this cor- 
responds to a promotion of the unpaired particle from the state 0 to the 
state p. 

In the same low T limit the entropy (8.3.7) becomes 



= ln2 + (eH - e'"') e"^(4 ’ " 4 , (8.4.9) 

p 



where the term In 2 reflects the degeneracy of the ground state. 

In order to extract the condensed matter limit expressions of the gap 
equation and of the entropy, one must start from the original 
equations (8.2.10) and (8.3.7). Indeed, the condition (3.4.4) which insures 
that the temperature is smaller than Cp^^ -I- does not imply that it is less 
than e|r^— e[)~^ as it has been assumed to obtain equations (8.4.7) and (8.4.9). 
Using the formula (3.4.5). the quantity r_ can be approximated as 





In r_ = —4 w 



(8.4.10) 
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Then, after a somewhat lengthy calculation, one obtains 



^ 2w^4/3u>A' 



(8.4.11) 



One checks that at T = 0, the gap A_ is smaller than A. This is a conse- 
quence of the blocking effect which eliminates one level at the Fermi surface. 
As a matter of fact, A_(0) is solution of the equation 



2 

G 





w de 

^Je^ + A^iof ’ 



(8.4.12) 



which only differs from equation (3.4.3) by the suppression of an interval of 
width 1/w at e = 0. 

An analysis of the entropy performed along the same lines gives 



S't 1 = In 2 -I- - -I- - In 



27 t w'^A 



(8.4.13) 



Apart from the contribution In 2 associated with the ground state degener- 
acy, one finds an expression like the Sackur Tetrode entropy for a classical 
gas of one particle moving on a line. Thus, the limit at low temperature 
of the heat capacity C = TdS'[“l/d'C is 1/2. Of course, this result is only 
valid as long as the condition (3.4.4) is satisfied. At temperatures such that 
wT <C 1, the correct formula for the entropy is (8.4.9). 



8.5 Numerical illustration 

A solution of equations (8.2.4, 8.2.7, 8.2.10-8.2.13) can be easily performed 
by standard iteration methods for any type of single particle spectrum. To 
pursue the analysis, we consider a simple model such that the energies Cp 
the twofold degenerate levels are regularly distributed with a density w. For 
the numerical application presented here, we have adjusted the interaction 
strength so that the quantity wA is equal to 10. The active space is defined 
by a cutoff energy A such that A/A = 20 (be. 201 p,p levels belong to 
the active space). This is realized for a pairing strength Gpq = G(1 — 6pq) 
with wG = 0.34. This corresponds to a situation intermediate between 
nuclei {wA « 4) and superconducting metallic islands wA > 50. When 
temperatures are measured in units of the BCS gap at T = 0, the BCS 
critical temperature Tc corresponds to the value 0.56. 
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Fig. 3. Comparison of BCS and number-parity projected gaps at the Fermi surface 
versus temperature for wA = 10 and wA — 200. See text. 

The upper part of Figure 3 shows the gap for an even system with two 
hundred particles. Thus, half of the levels in the pairing active space are 
filled in the absence of pairing interaction. According to equation (8.2.10) 
the values of the gap depend on p. On the other hand, direct computation 
shows that the variation versus p never exceeds a few % and can therefore 
be neglected as will be done in the following analysis. We plot our results 
for the level 0 corresponding to the smallest quasi particle energy. The even 
number parity projected gap is larger than the BCS value as is expected 
from the discussion at the end of Section 8.2. However, the effect is very 
small. 

The lower part of the same figure shows the results obtained when rj = —. 
At r = 0, due to blocking, the gap A_ is less than A. Then, as the 
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temperature increases, so does the gap A_(/3) until the curve merges with 
that of the BCS approximation at a temperature T/A ~ 0.35. 




Fig. 4. Specific heat of an odd-number system for wA = 10. See text. 

Figure 4 illustrates our discussion of the entropy in odd systems of 
Section 8.4. It displays the evolution of the specific heat Cy calculated 
as T(dS'[“l/dT) versus temperature for the same system. From the tran- 
sition temperature downward, the odd number projected specific heat first 
becomes smaller than the BCS value. On the other hand, the inset shows 
that at small T, it crosses the BCS curves and approaches the limit 1/2 as 
predicted by equation (8.4.13). As compared with the specific heat of a non 
interacting fermion system {S = (27r^wT)/3) (“non int.” curve) the BCS 
and odd number parity projected curves exhibit the well known discontinu- 
ity at the transition temperature. We remind the reader, that, according 
to equation (8.4.9), at T = 0, the odd number parity entropy reaches the 
value ln2 associated with the degeneracy of the ground state and that Cy 
vanishes. 

9 Odd-even effects 

9.1 Number parity projected free energy differences 

Up to now we have been working in the grand canonical formalism and have 
been concerned mostly with the Gibbs free energies Fq . For systems with 
a definite number of particles Nq, one must instead consider the free energy 
which is related to by a Legendre transform 



fM(Ao) = Fg'’'1(m)+M^o, 



(9.1.1) 
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where the chemical potential ^ is adjusted so that equation (8.3.6) is satis- 
fied. 






8 



1/w 

_Mi 

l/w 

" 



0 



No=2n No=2n+l 

Fig. 5. Evolution of the Fermi level /i from an even {No = 2n) to an odd {No = 
2n -I- 1) value of for 77 = -f, — or 0. 

Although only have a physical meaning for even systems and 

F^~\ for odd systems, they can as well be evaluated for any (even 

non integer) value of the expectation value (TV)!**] by means of a continu- 
ous modification of /i. We have done it for the physical system analyzed 
in Section 8.5. For both 77 = ± (and 77 = 0 in the BCS case) we have let 
(]V)[r 7 ] yary from 2n = 200 to 201. In the three cases, we find, as intuition 
suggests, that the chemical potential fj, increases continuously in the inter- 
val [— l/2w, 0] as sketched in Figure 5. An accurate approximation of the 
relation between /i and the average number of particle (N) [’^1 is provided by 
the relation 

^{2n + x) = (1 — a;). (9.1.2) 

2w 

At finite temperature, the relevant difference between odd and even neigh- 
bour systems is 

5F = F'^-\2n+l)-F^+\2n), (9.1.3) 

with the free energies defined by equation (9.1.1). For the system 
considered in Section 8.5, it can be computed directly from the numerical 
solutions of the equations derived in Section 8.2. The result is plotted as 
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the solid curve in Figure 6 . Two regimes are clearly visible. At small 
temperatures, the difference (9.1.3) decreases almost linearly from a value 
very close to A at T = 0, until it vanishes at a crossover temperature 
close to 0.35A. Above this temperature the difference remains constant 
and equal to a small negative value ~ — 0.05A. Although superconductivity 
still exists up to the transition temperature Tc = 0.56A (see Fig. 4), SF does 
not exhibit any modification when the pairing disappears. In other words, 
above the crossover temperature and below Tc, the odd-even difference of 
the free energies does not reflect the still existing pairing correlations. 




Fig. 6. Difference between the free energies of neighbouring odd {No = 201) and 
even {No — 200) systems as a function of temperature for wA = 10 and wA = 200. 

This phenomenon has been mapped out carefully in the experiments 
reported in [9] . 

We will now attempt a semiquantitave analysis of this result. Based on 
the discussion at the beginning of this section, we consider a path which 
connects the systems Ng = 2n and Nq = 2n -I- 1. The difference SF is 
written as a sum of two contributions 

SF = SiF + S 2 F 

SiF = tH(2u +1)- T[+l( 2 n -b 1) , S 2 F = F^+\2n + 1) - F^+\2n) . 

(9.1.4) 

Since for both values of 77 the chemical potential fj,{{N)^'^^ = 2n+l) vanishes, 
one obtains 

SiF = FG^~\^J. = 0)-Tg^+'(m = 0) = -ilni|^ ~ -4 In | > (9.1.5) 

p p 1 -b r+ 

by means of equations (8.3.5) and (8.1.7) and with the assumption that 
lnZ[+’*^l ~ lnZ[“’°l. As a matter of fact, from the numerical solution one 
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can check the validity of this assumption. For small temperatures, it is 
justified to use the approximation (8.4.10) of lnr_. We then obtain (in [24], 
we assumed A_ = A instead of equation (8.4.11)) 



1 T 

(jiF’cs A In 

2 w 2 



/ 8{ttwA)‘^ \ 

V /3A j' 



(9.1.6) 



The contribution 62 F can be evaluated from the thermodynamic relation 
valid for the free energy 



aF’M _ 



(9.1.7) 



Using the approximation (9.1.2), we calculate 



‘ 2 n -\- 1 



S2F = 



M(Ao)dAo 



1 

4w 



(9.1.8) 



In Figure 6, the dotted curve corresponds to 62 F (it is labelled BCS since the 
value —1/Aw is the result obtained for the BCS theory at any temperature) 
while the dash-dot curve is 5\F . The quality of the agreement vindicates 
our analysis, although it relies on the definition of a path such that, except 
for the end points, there exists no corresponding physical system. 

Below the crossover temperature, pairing effects are detectable in the 
difference 5F . Above this temperature, there only remain effects associ- 
ated with the discreteness of the spectrum measured by level density w. 
The range of temperature over which pairing correlations affect 5F is de- 
termined by the slope of the curve 61 F. This slope increases as the size of 
the system grows {w decreases). For sufficiently large systems, the temper- 
ature range over which odd-even effects are visible will be out of reach for 
present experimental cooling techniques. In such a situation, number-parity 
projected BCS and standard BCS make identical predictions. 



9.2 Nuclear odd-even energy differences 

Although the last section was concerned with the temperature dependence 
of the odd-even free energy difference. Figure 6 shows that, at T = 0, i5U 
which becomes the difference between the binding energies S{F[), is equal to 
A — 1 /2w. This quantity involves contributions from both pairing and single 
particle spectrum origin. In this section, we report on a detailed analysis of 
this point in the context of nuclear physics. 

Until recently, within the nuclear physics community there existed a con- 
sensus as to how the observed odd-even staggering of the binding energies 
should be interpreted and used to extract information on pairing proper- 
ties [3,31]. This state of affair was somewhat disturbed by a work which. 
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among several other points, analyzed the well established odd-even stag- 
gering of clusters binding energies [32] and explained it solely in terms of 
shape deformation [33]. Its authors asserted that connecting the observed 
staggering to pairing correlations was not justified for clusters and therefore 
probably also not for nuclei. Since the reasons to believe in the importance 
of pairing concepts for the dynamics of nuclei extends well beyond the odd- 
even mass staggering effects (see Introduction), the authors of [34,35] took 
up the challenge and decided to review the analysis of nuclear data which 
had been performed in the old days. It is their work that this section sum- 
marizes. 

Rather than the simple difference discussed in Section 9.1, for systems 
at zero temperature it is common to measure the amplitude of the odd- 
even staggering by means of “filters” on the binding energy. The simplest 
of these filters is an average of differences involving the binding energies of 
three neighbouring systems 

A® (IV) = ^ ^ [E{N - 1) - 2E{N) + E{N + 1)] . (9.2.1) 

For nuclei, the index v specifies the type of particle (neutron or proton) 
whose number N is involved in the differences. 

Even in the absence of pairing, the twofold Kramers degeneracy of single 
particle levels naturally leads to odd-even oscillations of For instance, 
for the one-body Hamiltonian Hg (2.3.1) (we assume here that the ener- 
gies 6p are increasing with the index p ) , one has 

A'i?\2n+1)= 0 

( 3 ) 1 

Ai\2n) =-(e„+i-e„). 



Hence, for a non interacting Hamiltonian, the filter (9.2.1) vanishes for odd 
particle numbers and gives half the single-particle spacings at even particle 
numbers. A first conclusion is that the non zero value of the filter does not 
establish the existence of pairing correlations. 

A straightforward generalization of the relation (9.2.2) to the full 
Hamiltonian (3.2.7), leads to the prescription (9.2.3) which is pictured in 
Figure 7: the filters centered at odd values determine pairing gaps while 
the difference between consecutive filters gives access to the single particle 
spacings 



A= A®(2n-k 1) 



^n+1 — 2 



A^^\2n)-A\f>{2n+l) 



( 3 ), 



(9.2.3) 



As shown by [34], it is a remarkable and somewhat surprising fact that 
that this simple prescription remains the correct one for strongly correlated 
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Fig. 7. From [34] Principle of the analysis of the odd-even staggering of the nuclear 
binding energies according to equation (9.2.3). The particle number corresponds 
either to neutron or proton. 



systems such as the nuclei. On the other hand, it cannot be expected to 
be valid for any system and any Hamiltonian. Indeed, in general, because 
particles are interacting between themselves or with partners (proton and 
neutrons in nuclei) or with the background (interaction with surface for the 
clusters), one should expect a deviation from the relation (9.2.3) even when 
pairing forces are absent. An example is provided by the metallic clusters 
where there are good reasons to believe that the pairing mechanism is either 
not present or of little importance and for which the odd and even centered 
filters have approximately the same absolute value (Ref. [33]). 

Reference [34] analyzes separately the contributions to the filters of the 
several components of the total energy of a nucleus. One part of this 
work relies on the decomposition of the Hartree-Fock energy proposed by 
Strutinsky [36]. According to this author, the total energy can first be 
written as the sum of the pairing energy and a mean-field energy. The 
latter is again split into three contributions associated respectively with 
i) a one-body Hamiltonian similar to Hq (2.3.1) (although it is in principle 
self consistently defined), ii) a single-particle smooth background and iii) a 
liquid drop bulk energy. Then, from the well established knowledge of the 
liquid drop and the single particle densities, the authors of [34] show that the 
contributions to the filters of the last two terms cancel each other almost ex- 
actly. They conclude that, effectively, the only non vanishing contributions 
to the filter (9.2.1) come from the one-body and the pairing contribution. 
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For nuclei, it is therefore justified to analyze data by means of the simple 
prescription (9.2.3). 

Entering the details of this analysis goes beyond the scope of these notes. 
Moreover, Ref. [34] gives another more direct justification of the prescrip- 
tion (9.2.3) based on a numerical instead of a theoretical analysis. Figure 8 
displays the filter A(/ calculated from numerical Hartree-Fock (HF ) ener- 
gies computed with the effective interaction Sill [37]. This force has been 
tested with success on the nuclear energies over the entire chart of isotopes 
(that is for binding energies varying over almost two orders of magnitude) as 
well as on many other nuclear properties including multipole deformations. 
Such HF energies take into account the contributions of the one-body and 
two-body parts of the Hamiltonian but not those due to the pairing interac- 
tion. Indeed, the HF formalism (in contrast with Hartree Fock Bogoliubov) 
automatically eliminates the pairing terms from the mean-field description. 
The figure shows that the odd filters vanish and thus confirms the anal- 
ysis performed on the smooth Strutinsky contributions. In addition, the 
authors of Ref. [34] show that the even filters give a good description of the 
Hartree-Fock single particle spacing. 

Now that the prescription (9.2.4) has been established on firm ground for 
nuclei, it can be used to analyze the experimental binding energies [38]. This 
gives an access both to the pairing gap and to the single-particle spectrum. 

In Figure 9, the black dots associated with odd centers filters A), T2n-|-1) 
provide an experimental estimate of the neutron pairing gap. One expects 
the curve to display minima at magic numbers since the reduced level den- 
sity decreases the effectiveness of the pairing Hamiltonian. This is verified 
for the neutron numbers N = 28 and 50 and (not visible on this figure) 
the numbers 82 and 126. On the other hand, there is no indication of a 
minimum at A = 20 while there is a deep minimum near A = 16. This 
result may be related with the experimentally established weakness of the 
magic number 20 which has been particularly investigated because of its 
possible implications on the astrophysical nucleosynthesis processes [39] . 

The vertical difference between the white dot curve and the black dot 
curve yields information on the neutron single particle level spacing at the 
fermi surface. One expects this difference to be large near magic nuclei. This 
is indeed the case for A = 28, 50, 82 and 126. One observes a structure at 
A = 20 which provides an evidence for an increased level spacing although 
still smaller for instance than at A = 16. 

Each point in Figure 9 is an average over several isotopes with even 
values of the proton number Z. This averaging procedure may affect the 
conclusion when the nuclear structure is locally dependent on the number 
of protons. Nevertheless, the picture which results from the present analysis 
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Fig. 8. From [34]. Odd and even centered Filters evaluated from Hartree- 
Fock binding energies with the effective interaction Sill. 

is much more convincing than that provided by the fourth order filter 

A W (A) = i [a® (n) + A(3) (N + 1)] , (9.2.4) 

which, until now, was considered to yield “cleaner” values of the gap be- 
cause it corresponds to a higher order Taylor expansion [3,31]. The above 
analysis shows that A^"^^ is a somewhat unsatisfactory quantity which mixes 
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Fig. 9. From [34]. Neutron odd and even centered filters and odd centered 
filter evaluated from experimental nuclear binding energies. 



contributions with distinct physical contents. Moreover, if one keeps in mind 
that the addition of only a few nucleons sometimes leads to a significant re- 
organization of the mean field potential and of the single particle spectra, 
one has other reasons to remain cautious when using filters which involve a 
large number of nuclei. 

In the past, results based on the filter have been used to establish a 
reference function giving the average pairing gap over the chart of isotopes: 
A ~ Ylj\fA {A = N + Z) [3]. In the light of the analysis of [34], a slower 
decrease versus A reproduces more accurately the average behaviour of the 
data. It turns out that this is consistent with theoretical findings whether 
they are estimated by means of a zero range force or obtained with the most 
refined Hartree-Fock Bogoliubov calculations [40] . 

To conclude this section, one may say that the nuclear physics com- 
munity can be grateful to the cluster physics community; not so much be- 
cause of the somewhat controversial conclusions on nuclear pairing that 
some members of the latter community infered from the findings in their 
field, but because the wave they generated has led nuclear physicists to re- 
assess what seemed to be a firmly established corpus of knowledge. Scientific 
progress travels on unexpected routes. 
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10 Extensions to very small systems 

In Sections 4.4 and 7 we have discussed number particle projection at 
zero and finite temperatures. Although the formalism is written for the 
simple Hamiltonian (2.3.1) and (3.2.7) selected for these lectures, an ex- 
tension to more general interactions and to more complex description of 
the Hartree-Fock-Bogoliubov type for the the wave-function at T = 0 or 
density operator is feasible. It has been tested successfully against data for 
nuclei at low and high spins [21]. On the other hand, an Hamiltonian very 
close to (3.2.7) can be studied exactly at T = 0. Moreover, for zero and fi- 
nite temperatures, refined approximations taking into account not only the 
fluctuations of the gauge angle (p by means of the projection Pnq: but also 
the fluctuations of the magnitude of the gap A can be introduced. Recently, 
these methods have been used to attack the problem set by Anderson ques- 
tion [44-50]. The following sections contain a succint description of those 
works and comment some of their results. 



10.1 Zero temperature 

From now on, we consider the schematic Hamiltonian 

H = Ho- GPt p ^ = Y^ 44> (10.1.1) 

p 

where Hq is defined as in (2.3.1) and G is a positive constant. (The main 
difference between (10.1.1) and (3.2.7), is that in the latter we chose the 
matrix elements Gpp to be zero. This ensures that there is no contribu- 
tion of the two-body interaction to the average particle potential (no state 
dependent shift of the single particle energies Cp) and it leads to shorter 
formulas) . 

In the Hamiltonian (10.1.1), the two-body potential scatters pairs from 
any fully occupied level p to any fully empty level q, while it leaves half 
filled levels unchanged. In order to expand exact solutions with a particle 
number equal to Nq, it is therefore convenient to label the basis states by 
the number of unpaired particles Au and the list of levels [rij] which are 
partially filled: 



Np 

IK']iVuK]iVp) = (10.1.2) 

j=i i=i 

Here fij specifies which of the two states nj or fij is occupied. The indices 
[rrij] characterize the fully occupied single particle states. The collections 
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of indices [nj] and [rrii] must not overlap. The quantities fVu and A^p are 
connected to the particle number by the relation 



fVu + 2iVp = No. (10.1.3) 

Since the subspaces of the Hilbert associated with distinct collections of 
indices fij are unconnected by the Hamiltonian, fVu (or fVp) is an invariant 
which can be used to sort the eigenstates into families of solutions. For a 
given set of indices fij the solution of the problem amounts to the diago- 
nalization of a matrix with dimension . This matrix is sparse since, 

apart from the diagonal, only the matrix elements between states differing 
by at most one index in their collections of indices rrij are connected by 
the two-body force. On a given line of the matrix, the number of non zero 
matrix elements (all equal to —G) is A^p x (O — A^„ — A^p)- This number is 
thus much smaller than the dimension of the matrix. 

At T = 0, for negative values of G the ground state of an even system 
will always be in the subspace of dimension corresponding to states 

\[mj\Ng/ 2 ). For an odd system, several (at most O) calculations may be 
necessary in subspaces of dimension ((jv^~i)/ 2 ) ^^er selection of an unpaired 
level occupied by only one particle. In practice, fewer calculations suffice, 
since the optimal state to be blocked must be in the vicinity of the Fermi 
level. For studies at finite temperature, in principle, one has to diagonalize 
the Hamiltonian in all the subspaces compatible with the relation (10.1.3). 
The canonical thermal averages of operators are then obtained by sums 
with Boltzmann weight factors. At low temperature, it is not necessary to 
diagonalize H in the Hilbert subspaces such that the energy of the lowest 
eigenstate is much larger than the temperature. 

Although for large values of and No the problem remains of consider- 
able size, it is within reach of today’s computational means for dimensions 
compatible with systems of interest (from one digit to three-digit num- 
bers of interacting particles and levels). Note that, in shell model calcu- 
lations of nuclear structure, one now routinely solves diagonalization prob- 
lems involving much more complex Hamiltonians than (10.1.1) and matrices 
with dimensions exceeding 10^® [51]. As a matter of fact, for the simple 
Hamiltonian (10.1.1), a remarkable analysis of the problem by Richardson, 
based on a quasi boson representation, has shown that the determination 
of the ground state energy and wave-function in every subspace can be ef- 
fected by means of a coupled set of — A^u non linear equations [52] . Using 
one of the aforementioned numerical methods, in recent years, the T = 0 
problem has been thorougly discussed by several groups when Ho corre- 
sponds to a regularly spaced single particle spectrum (picket fence) [44-48] . 
Some results extracted from [47], in relation with the Anderson’s question 
mentioned in the Introduction, are presented in Figure ??. 
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Fig. 10. From [47]. Comparison between exact and several approximate results 
at T = 0 for the condensation energy and the odd-even binding energy staggering. 
The Hamiltonian is given in equation (10.1.1) (see text). 



Since there are several ways to define the limit of a small superconducting 
system, we indicate here the choice made in [47]. We have seen in Section 2.1 
that, in first approximation, the volume of the system is inversely propor- 
tional to the distance d= 1/w between single particle levels. According to 
Anderson’s suggestion [12], one would like to investigate the region where 
d becomes equal or larger than some quantity measuring the strength of 
pairing correlations. Reference [47] defines this quantity as the continuous 
spectrum pairing gap solution of equation (3.4.3) A = A/[2sinh(l/rcG)] (in 
the figure, the notation is A) . A is kept constant as d = 1/w varies by keep- 
ing both A and wG unchanged {wG = .224). Thus, as the abscissa x = d/A 
increases, so does G. In contrast, the density w, the volume and the number 
of particles N = wA decrease while the density of fermions involved in the 
pairing phenomenon remains constant. The last equation implies that the 
Fermi level does not move within the single particle spectrum. In [47] it is 
assumed to remain approximately at mid spectrum. 

The range of abscissae investigated correspond to a rather stretched 
scale. For macroscopic and even microscopic superconducting systems the 
value of X is almost zero. The case investigated numerically in Section 8.5 
corresponds to a: = l/(wA) = 0.1 and, for nuclei, the value of x varies 
between 0.2 to 0.4 depending on the difference between the values of N or 
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Z and a magic number. It is not yet clear which values of x are reached in 
nanoscopic grains. 

The upper part of the figure displays the difference between the ground 
state energy and the first order perturbation corrected energy (i.e. the ex- 
pectation value of the full Hamiltonian (10.1.1) in the ground state of Hq)- 
One sees that the odd and even exact curves are well reproduced by all 
pairing approximations when x is less than one. For values of the abscissa 
larger than two, the projected approximations are of good quality while 
standard BCS underestimates the fluctuation energy. For a; « 1, none of 
the approximate methods can be said to work very well. 

In the lower part of the figure, the filter Al,’{2n + 1). (In Fig. ??, 
A^^^(2n-|-1) is denoted is plotted as a function of x. For a; < 1, the dot 
curve corresponds to the approximation a 1.^^/A « 1 — l/(2a:) = A_(0)/A 
given by equation (8.4.11). For these small values of d/A, BCS (blocked 
BCS is used for odd systems) and all other approximations reproduce well 
the exact results. When x is larger than one, the BCS result approaches 
the value G/2 which correspond to the filter aI.^^ calculated from ground 
state energies evaluated by means of a first order perturbation method. This 
corresponds to a situation where pairing correlations have vanished. The de- 
viation from the exact result is significant. For large values of x, the particle 
number projected BCS method provides a reasonably good approximation. 
On the other hand, although this method predicts correctly the zone of 
the transition between the two regimes, the quality of the quantitative es- 
timate of the filter is modest for a; « 1. The results obtained by solving the 
Richardson equations (labeled “exact”) are reproduced accurately by the 
numerical diagonalization (open circles) as well as by the density matrix 
renormalization [46] to the point that the curves almost undistinguishable. 

From Figure ??, one can draw the conclusion that under the conditions 
selected for this calculation there exist two distinct dynamical regimes. On 
the other hand, as d/A increases from 0.5 to 2, the transition between them 
is very smooth. 

As mentioned above, it is still not clear whether there exist a set of 
systems which would allow us to probe the evolution displayed in the lower 
part of Figure ??. Note also that because the model requires that the 
strength G grows as the inverse of the single particle energy density, no 
part of the curve can be said to correspond to a perturbation regime. 

The results we have just discussed have been obtained for a picket fence 
single particle spectrum. Recently, additional information has been col- 
lected which, in metallic grains, suggests a more complex structure with 
some evidences for spin orbit structure as in atoms and nuclei [53,54]. In 
addition, the production technique of the grains does not warrant a regu- 
lar shape for the grains. For this reason, several groups have investigated 
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the transition problem assuming randomly distributed single particle spac- 
ings [47,55,56]. 

Another approach to the same problem is the Generator Coordinate 
Method (GCM) [18,57,58]. The approximate wave function is constructed 
as a linear superposition 

14/)=/ dG/(G)P^jBCS(G)), (10.1.4) 

Jo 

where the state PArg|BCS(G)) is the projected BCS solution calculated with 
the auxiliary Hamiltonian 



H = Ho-GP'<P, (0<G<oo). (10.1.5) 

The unknown function /(G) which minimizes the energy E calculated with 
the Hamiltonian (10.1.1) 



S_E_ (£^ 

Sf ’ (4/|4/) 



( 10 . 1 . 6 ) 



is solution of the Hill Wheeler integral equation. Thanks to a general- 
ized Wick theorem [59], the calculation of the kernels entering the inte- 
gral equation can be evaluated for Hamiltonians much more sophisticated 
than (10.1.1) [60]. For H given by (10.1.1), a comparison with exact calcula- 
tions [61] shows that the GCM method provides a very accurate description 
of the exact ground state and first excited states. Indeed, it can be shown 
that the overlaps of the exact and the GCM many-body wave functions |4/) 
differ from unity by very small amounts (typically less than 10“^). 



10.2 Finite temperatures 

One of the result of [47] is that, at T = 0, as the size of the system decreases, 
there is a smooth transition between two regimes. A similar prediction for 
the temperature dependence of pairing effects in small system had been 
made as early as 1972 [62]. In this section, we describe a recent work, which 
reaches the same conclusion by means of a fully microscopic analysis [49, 50] . 
First we note that by introducing the quantities 

G 

v,=v, = - 

€p — Cp 2 ^ ’ ^0 — ^ ^ ^.pi^CtpQp (Zp(Zp) 

P 



( 10 . 2 . 1 ) 
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Fig. 11. From [50] Temperature evolution of the pairing gap equivalent quan- 
tity (10.2.9) for an exact calculation and several approximations for an odd sys- 
tem. The letters a to d refer to decreasing pairing intensities (see text). For the 
intensities c and d, the (number parity projected) BCS gap vanish identically. 



up to a c- number, the Hamiltonian 10.1.1 can be put into the following form 

^ i/=l,2 



We now introduce the r-dependent (0 < r < /3) auxiliary fields {x,^} and 
the family of one-body Hamiltonians 



H{{x{r)}) 



Ho+ E 



Xi,{t)'^ 

2Vy 




(10.2.3) 



Then, by means of Hubbard-Stratonovitch transformation [63], the exact 
number parity projected partition function can be expressed as a func- 
tional integral over the fields {x^} 



Zrj = Tr Pr, 



-PH ^ 



= / D[x] TrP^Te 



- / drH({a;(r)}) 



(10.2.4) 
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in which is the number parity projection (8.1.1), T stands for the time 
ordering operator and the measure D[x\ is such that 





dr 



Xu{tY 

2v„ 



= 1 . 



(10.2.5) 



The formulation (10.2.4) is the starting point of Monte-Carlo methods which 
have been used, for instance, to calculate nuclear level densities relevant for 
astrophysical nucleosynthesis [64] . 

Reference [50] transforms (10.2.4) in order to implement the Correlated 
Static Path Approximation (CSPA) approximation [65-72]. This transfor- 
mation involves a Fourier expansion of the fluctuating fields x^{t) 

x^(r) = 4°) -h ^ = ^. (10.2.6) 

n^O ^ 



From this expansion, an expression of the partition function equivalent 
to (10.2.4) can be obtained 



Zv= I U d[x(°)] n d[x(")] 

i/=l,2, n^O 



u=l,2 



Tr PjjTe 



with 









!^= 1 , 2 , n^O 



2Vu 



(10.2.7) 




( 10 . 2 . 8 ) 

Within the CSPA, the fluctuations associated with the static fields are 

(n) 

exactly taken into account by integration, while for the other fields xi, 
the integral is performed in the Gaussian approximation. The CSPA is 
therefore essentially exact at high temperature (small (3 values). On the 
other hand, because it also accounts for the quadratic fluctuations of the 
fields compnents xi, ^ over the interval 0 < r < /3, its domain of validity can 
extend to rather small temperatures. 

Among several other results, reference [50] presents a numerical compar- 
ison of CSPA and exact results for a picket fence model involving ten single 
particle levels with a spacing e = 1/w and ten or eleven particles. The re- 
sults displayed in Figures 11 and 12 have been calculated for several values 
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Fig. 12. From [50] Temperature evolution of the pairing gap equivalent quan- 
tity (10.2.9) for an exact calculation and several approximations for an even sys- 
tem. The symbols for the curves are the same as in Figure 11. The letters a to d 
refer to decreasing pairing intensities (see text). For the intensity d, the (number 
parity projected) BCS gaps vanishes identically. 



of the strength G selected to emulate small paired systems. In the figures, 
they correspond to the labels a {wG = 0.55), b {wG = 0.45), c {wG = 0.35) 
and d {wG = 0.2), respectively. Since the pairing gap A is defined only 
within the mean-field methods, the authors introduce the quantity 

Ap = G^{P^P)^G)-{P^P)iG=0) (10.2.9) 

which can be evaluated for any method. It is a measure of the intensity of 
pairing correlations which can be shown to reduce to the BCS gap A in the 
BCS approximation. 

The figures show an excellent agreement between exact and CSPA calcu- 
lations down to small temperatures. The curves do not display any sign of a 
phase transition. Both in the odd and even cases and for the four strengthes. 
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one only observes a gentle decrease of Ap(T) as the temperature exceeds 
the BCS critical temperature (« 0.56Ap(0)). 

Two other sets of curves have been plotted. One of them corresponds 
to the Static Path Approximation (SPA). In this approximation, the con- 
tribution of the Fourier components n yf 0 of the fields are neglected. 
As expected, SPA gives a reasonable agreement at high temperature and 
displays the same smooth evolution as CSPA. At zero temperature, for even 
systems (and not too small values of G), it joins the number parity projected 
BCS curve (BCS). This latter curve is systematically below the exact (and 
CSPA) results. It also displays the transition typical of the mean field meth- 
ods which turns out to be unphysical in such very small systems, i.e. for 
values of wA of the order or smaller than unity. It will be interesting to 
apply SPA and CSPA to larger systems in order to determine the limit of 
validity of the number-parity projected (or particle-number projected) BCS 
method. 

11 Conclusions and perspectives 

From this short review, I hope to have convinced you that, if not tomorrow, 
at least very soon, we will have at our disposal a set of methods appropriate 
for handling pairing Hamiltonians for condensed matter systems of any size 
from the microscopic to the nanoscopic domain and for nuclei as well. 

In this lecture, I have also tried to stress that, when finite systems are 
considered, it is important to restore the symmetry which is broken at the 
level of the mean field approximation (BCS). The order parameter A is 
characterized both by a phase tp and an amplitude. Restoration of particle 
number symmetry is achieved by means of an integration over the phase 
to ensure that the particle number is exactly (and not on the average) 
equal to Nq. A useful first approximation to this projection for systems 
of microscopic size (and also for nuclei) consists in the projection on the 
particle number parity. 

On the other hand, for very small system {wA < 1) the fluctuations 
of the amplitude |A| of the gap have also to be taken into account. For 
Hamiltonians whose complexity precludes an exact solution (in contrast 
with H given by Eq. (10.1.1)), the Generator Coordinate Method (at T = 0) 
or path integral methods such as the CSPA appear to have an interesting 
potential. 

Therefore, the unsolved problems ahead of us do not appear to belong to 
the realm of methods and solution techniques but rather to that of the pair- 
ing dynamics. This is a question that this lecture has carefully avoided to 
address. The excuse I gave you was that, beyond the schematic Hamiltonian 
used in these notes, any attempt to introduce a more realistic description 
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of the pairing interaction would be specific of either the nuclear physics or 
the condensed matter physics domain and therefore not fit the goals of this 
school (especially if one takes into account that my knowledge of the ques- 
tion is restricted to nuclear aspects). It is my conviction that it is indeed in 
the case. I do not see why a phonon mediated attraction between screened 
electrons relates to the multipole structure of an interaction between nu- 
cleons acting in different isospin channels. The most important question 
is how each field can effectively acquire a better knowledge of the relevant 
pairing Hamiltonian. Traditionally, two lines of attack can be considered. 

The first is of a phenomenological nature. It relies on the confrontation of 
theoretical results with a sufficiently large body of data. In nuclear physics, 
the availability of segmented gamma ray detectors has already allowed a very 
precise determination of many rotational bands. Several groups in Europe 
and in North America are competing to extract and gather these data. Thus, 
significant improvement of our knowledge of rotational structures can be 
expected in a near future. From it, and in particular from the dependence of 
the moments of inertia on both angular momentum and the single particle 
structure, one can hope to obtain enough information to constrain and 
characterize the pairing Hamiltonian. In condensed matter physics, the 
corresponding approach would rely on an analysis of magnetic properties 
od small superconducting grains. However, although the feasibility of such 
studies has already been demonstrated, the situation does appear as mature 
for small superconducting grains as it is for nuclei. The competence in 
making these grains and studying them is still very much confined. In 
addition, while a nucleus is a reproducible object, no two metallic grains 
are identical. Sorting out pairing contributions from dynamical information 
associated with a mean field which varies randomly from one experiment to 
the next may be a difficult task despite the ingenuity displayed by both the 
experimentalists and the theoreticians of the field. 

In another approach, one could try to determine the Hamiltonian from 
first (or at least prior) principles and a better understanding of the mi- 
croscopic mechanisms which generate pair attraction. My opinion is that 
the possibility for this to happen in nuclear physics, is small at least in 
a near future. Indeed, there is still a large theory gap between Quantum 
Chromodynamics or even Boson Exchange Theories and the effective nu- 
cleon interaction which determines the structure of nuclei. In addition, the 
size of the community working on the subject is rather limited. Although 
(or perhaps because) it is not my domain, I think that the situation is 
more favorable in condensed matter physics. Indeed, one can hope that 
the important theoretical effort presently devoted to explaining the mecha- 
nisms underlying high Tc superconductivity in compound crystals will yield 
some quantitative improvements of the overall understanding of the pairing 
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mechanisms which may turn out to be relevant for small metallic systems 
as well. 

To conclude I would like to return to the question asked by Anderson 
more than forty years ago [12]. In his article in J. Phys. Chem. Solids, 
one could read “An interesting question is [at] what size of particles and 
at what degree of scattering will superconductivity actually cease” . I have 
the feeling that over forty years most the tools necessary to solve problems 
related with the “size” aspect have been developped. On the other hand, 
there is still some important work ahead of us to understand that part of 
the question which is related to the “degree of scattering” . 



I am indebted to R. Balian and M. Veneroni for an enjoyable collaboration of many 
years and for their steady support. I also want to express my thanks to J. Dobaczewski, 
J. Dukelski, P.H. Heenen, P. Ring, R. Rossignoli and W. Satula for useful discussions 
and for providing some of the figures which illustrate this lecture. 
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Abstract 

The electronic structure of simple metal clusters and quantum dots 
is studied on the basis of the density functional theory and simple 
models. It is demonstrated that single-particle models explain well 
the gross features of deformation and magnetism in small clusters, 
nuclei and quantum dots and that the local density approximation 
can give valuable information of the internal structure of the many- 
body state. 

1 Introduction 

The observation of magic numbers in alkali metal clusters [1] started an 
extensive study of their electronic properties both by using simple models, 
like the jellium model [2] , and by performing accurate ab inition electronic 
structure calculations [3]. The same happened earlier in the case of atomic 
nuclei and more recently in the case of semiconductor quantum dots. Many 
physicists have worked on all these finite fermion systems. 

The purpose of these lectures is to show that many key properties can 
be most easily undestood by using simple models. Ab initio many-body 
calculations are definitely necessary but the best understanding is often 
obtained by comparing their results with those obtained by simple models. 

The density functional Kohn-Sham method [4] gives a formal way to 
transform the many-body problem into a single-particle problem. Most of 
the models in these lectures are based on the Kohn-Sham formalism. Special 
emphasis is put on the symmetry broken solutions and their relevance in 
giving information of the internal structure of the many-body wave function. 

2 Jellium model and the density functional theory 

The first model to explain the electronic shell structure in metal clusters 
was the jellium model. Before going to the applications of this model to 

© EDP Sciences, Springer- Verlag 2001 




300 



Atomic Clusters and Nanoparticles 



clusters it is interesting to review the use of this model in studying surfaces 
and defects in bulk metals. The background of the jellium model is the fact 
that many properties of simple metals can be understood by studying a ho- 
mogeneous electron gas, or nearly free electrons [5]. The band structure can 
be described with weak pseudopotentials: the electron energy levels form 
free electron energy bands, which are only slightly deformed close to the 
Brillouin zone boundaries. At the boundary an energy gap is opened. The 
size of this gap is 2|Vg|, i.e. twice the value of the pseudopotential compo- 
nent corresponding to the reciprocal lattice vector G. In alkali metals, with 
only one valence electron per atom, the Fermi surface fits well inside the 
first Brillouin zone and is very accurately spherical, as has been observed 
using several different methods. 

The cohesive energy of simple metals can be approximated using model 
pseudopotentials and assuming the electron density to be homogeneous. In 
calculating the total energy, however, it is important to evaluate the energy 
of the interacting electron gas. The use of interacting electron gas is essential 
also for understanding collective excitations. The jellium model means a 
homogeneous electron gas with a compensating homogeneous background 
charge. The physical approximation is that the charge of the positive ions 
is replaced with a homogeneous positive charge. 

A metal surface in the jellium model is descibed by cutting off the back- 
ground charge from the vacuum site of the surface. Usually this is done 
with a discontinuous step function 

n+{z) = 9{z)no, (2.1) 

where n+ is the positive background charge, 9 a step function and no the 
density of the homogeneous electron gas. The density is described by a 
density parameter Vg (no = S/dyrr^ag, uq being the Bohr radius), which 
varies between 2 and 6 for normal metals. In a similar fashion a vacancy is 
described by a spherical hole in the background density and a metal cluster 
by a homogeneous sphere of the background charge. 

In order to describe adequately the interacting electron system the den- 
sity functional Kohn-Sham method [4] can be used. We do not go in the 
details of the density functional method but only mention here the key equa- 
tions needed (for details see [6]). The total electron density is described as 
a sum over the occupied single-electron states, separately for each spin com- 
ponent 

n(r) = n|(r) -hnj.(r), (2.2) 

where 

occ 

na{r) = 

i 



(2.3) 
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The single particle states are self-consistent solutions of the Schrodinger 
equation 



-^VV*<r(r) + (2.4) 

where Veff is an effective potential consisting of the Coulomb potential and 
of the so-called exchange-correlation potential. In the local spin density ap- 
proximation (LSD A) the exchange-correlation potential depends locally on 
the electron spin density and can be calculated using existing interpolation 
formulas [6-8] . In cases where the system does not have a spin-polarization 
the local-density approximation (LDA) can be used. In this case the role of 
the spin is simply to add a degeneracy two to each single particle state. 

The surface of simple metals in the jellium model was studied first by 
Lang and Kohn [9, 10]. They solved the self-consistent electron density, 
surface energy and work function as a function of the electron density of 
the metal. While the mere jellium model gave good results for alkali metals 
it failed in high-density metals, e.g. by resulting to a negative surface energy. 
This drawback could be corrected by introducing ion pseudopotentials as a 
perturbation [10] or by taking them into account from the very beginning 
in the variational method [11]. It was observed that the average of the 
pseudopotential in the bulk was the key incredient to correct surface energies 
[12]. The deficiency of the jellium model in describing energetics was caused 
by the fact that in the bulk the rigid positive background stabilized the 
electron gas, but at the surface it could relax to a density closer to the 
minimun energy density of the homogeneous gas (coresponding to Vg « 4.2). 
Later, so-called stabilized jellium model [13, 14] was introduced to account 
the problem of the average pseudopotential (it was later applied also for 
clusters [15]). 

The jellium model has also been used to study spherical defects inside 
the metal. A vacancy or a spherical void is then described as a spherical 
hole in the background density [12,16]. The Kohn-Sham equations have 
been solved self-consistently and the results can be related to those of plane 
surfaces. However, the void formation energy could not be well described 
with pseudopotentials. The reason is that in large spherical voids the void 
surface (cut in a regular lattice) becomes very rough when the void size 
increases, causing the Coulomb energy eventually to diverge [12]. Similar 
problems are expected in the case when the jellium model is applied to 
planar surfaces with large Miller indexes (say [710]-surface). It is interesting 
to note that the same problem was met in the first application of the jellium 
model to metal clusters [17]. 
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The electron density close to a planar surface has slowly decaying Friedel 
oscillations 



, . cos(2fcp^:) 

n{z) — no (X ; 

kpz 



(2.5) 



where z is the distance from the surface and kp the Fermi wave vector. 
Similar oscillations will result around point defects in metals. In the linear 
screening theory they can be understood as arising from the logarithmic 
divergence of the derivative of the dielectric response function. In the Kohn- 
Sham theory they are a result of the phase shifts of the single particle wave 
functions: Far away from a defect, say vacancy, the radial wave functions 
are 



Ek£ oc 



cos(kr + 6({k)) 



where the scattering phase shifts obey the Friedel sum rule 



Siikp) = / d^r(n(r) - no) = 

e 



( 2 . 6 ) 



(2.7) 



where n is the electron density around the vacancy and no the bulk density, 
and Z the total charge removed with the background density at the vacancy. 
Related Friedel oscillations will allways exist at cluster surfaces. 

The applications of the jellium model to surfaces and defects are inher- 
ently scattering problems. There is a continuous spectrum of energy levels 
from zero to the Fermi level. Consequently, the electronic properties of 
spherical voids in bulk (jellium) are smooth functions of the size. There are 
no magic sizes of vacancy clusters caused by the electronic structure. The 
vacancy clusters have scattering resonances but since in a large void the 
repulsive effective potential is higher than the Fermi level, sharp resonances 
can only occur above the Fermi level and they do not directly affect on 
the stability of the ground state. In fact similar scattering resonances will 
accur also in spherical jellium clusters and have to be taken into account 
in calculating for example the photoionization cross section [18]. Note that 
the Friedel sum rule also applies for the scattering phase shifts in the case 
of a jellium cluster. 



3 Spherical jellium clusters 

A spherical jellium cluster consists of N interacting electrons moving in an 
external potential of a homogeneously charged sphere (positive background 
charge). The electrostatic potential energy of an electron in this background 
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charge is 



V{r) 



3e^Z 



SneoR^ 

e^Z 



Ancor ’ 




if r < i? 
if r > 0, 



(3.1) 



where R is the cluster radius and Z the total charge of the ionic background. 
For neutral clusters Z = N and R = Note that inside the jellium 

sphere the potential is harmonic. In a harmonic potential the center of mass 
motion separates out from the internal motion of the electrons. In large 
clusters the electrons are mainly inside the harmonic well and consequently 
the center of mass motion is the dominating collective resonance, i.e. the 
plasmon [2,19]. In the limit of larce clusters the resonance frequency is then 
independent of the electron-electron interactions. However, in studying the 
single particle motion the electron-electron interaction has to be taken into 
account. In the Kohn-Sham method this means that the electrons move 
in an effective potential (see above) which also include the electrostatic 
potential of the electrons and the exchange-correlation contributions. In a 
neutral jellium cluster the inside of the cluster will be electrically neutral 
and consequently the effective potential will be nearly constant inside the 
cluster and the potential resembles more a potential well with a rounded 
edge than a harmonic oscillator potential. The effective potential can be 
approximated with the Woods-Saxon potential 



V{r) = 



Vo 

1 -h e^V-R) ’ 



(3.2) 



where Vq is the depth of the potential and a a parameter determining the 
softness of the surface. Physically this softness comes from spilling out of 
electrons from the jellium sphere. There are several other ways to construct 
a model potential for stydying the electronic structure in spherical clusters. 
Mansikka-aho et al. [20] suggested the form 



V{r) 



Vo 

1 -k (r/i?)“ 



(3.3) 



The advantage of this potential is that it can be smoothly transformed from 
a square well (a ^ 0) to a potential with a Coulomb tail (a = 1). 

The self-consistent solution of the Kohn-Sham equations for jellium clus- 
ters can be found only numerically. First this was done by Martins et al. [17] 
using a parametrized potential and later fully self-consistently by many oth- 
ers [21-24]. The results for the electronic structure, ionization potential and 
stability are not very sensitive to the degree of the self-consistency and even 
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Fig. 1. Total energy per electron as a function of the number of electrons of a 
spherical jellium cluster (cg = 4). The solid line is the LDA result and The dashed 
line the LSDA results showing the effect of the Hund’s rule. 

the simple model potentials produce very well the essential features of the 
electronic shell structure. 

Figure 1 shows the total energy per electron of a jellium cluster and 
Figure 2 its second derivative A 2 E = E{N — 1) — 2E{N) + E{N + 1). The 
magic numbers 2, 8, 20, 34, and 40 are seen in the total energy of the LDA 
as minima and in the second derivative as maxima. The open shell clusters 
represent degenerate ground states. In these cases, the Jahn-Teller theorem 
dictates that if there is a mechanism to remove the degeneracy, the total 
energy can be decreased. In metal clusters the shape deformation is the 
most important mechanism (it will be discussed in the following sections). 
However, there is another mechanism to reduce the degeneracy. This is 
the first Hund’s rule which suggests that in an open shell the z-component 
of the spin will be at maximum. In the spin-density functional formalism 
(LSDA) this means that the single particle energy levels of the majority spin 
will be pulled down and those of the minority spins will be pushed up, thus 
reducing the degeneracy. In the case of a half-filled shell the degeneracy 
totally disappears since the spin-up electron shell will be full and the spin- 
down electron shell empty. Indeed, if we use LSDA in the spherical jellium 
model the A 2 E will have clear maxima also at half- filled shells as shown in 
Figure 2. In Figure I we see that for the nonmagic clusters the LDSA gives 
lower total energies than the LDA. 
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Fig. 2. The second derivative of the total energy as a function of the number of 
electrons of the spherical jellium cluster. The solid line is the LDA result and the 
dashed line the LSDA results showing the effect of the Hund’s rule. 



In the case of spherical clusters the open electron shell would also mean 
a large magnetic moment associated with the total spin, as first pointed 
out by Geguzin [25] for a pseudopotential model and by Hintermann and 
Manninen for jellium clusters [21]. However, later experiments and theo- 
retical considerations have shown that the shape deformation is much more 
effective in reducing the degeneracy of the open electronic shells. There is 
no evidence of magnetism of free alkali metal clusters. Nevertheless, there 
are suggestions that clusters confined in spherical cavities in zeolite could 
prevent the deformation and lead to magnetism according to the Hund’s 
rule. 



4 Effect of the lattice 

So far we have assumed that the electrons form an interacting electron gas 
but that they do not feel the underlaying lattice. The lattice can be taken 
into accout by using perturbation theory and proper pseudopotentials. It 
is well known from the solid state physics that model pseudopotentials can 
reproduce the band structure of crystalline metals as well as describe their 
cohesive erengy. Consider first the effect of the pseudopotentials on the shell 
structure of alkali metal clusters. The band structure theory for nearly free 
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electron metals shows that outside the immediate vicinity of the Brillouin 
zone boundaries the effect of the pseudopotentials come from the second 
order perturbation theory 



ek - Ck - X! 

G 






— e 



0 

k-G 



(4.1) 



where Ug is the Fourier component of the pseudopotential, G a reciprocal 
lattice vector and j2m. This equation gives us an estimate of 

the crystal field spitting caused by the pseudopotential lattice. When this 
energy difference becomes comparable to the energy gap between adjacent 
shells of the spherical jellium model the shell structure effects should disap- 
pear. The energy gaps in spherical jellium clusters can be estimated from 
the energy levels in a spherical cavity. They are determined by the zeroes 
of spherical bessel functions and give the following estimate for the energy 
gap between major shells [26] 



where N is the number of electrons and ep the fermi energy. The energy gap 
will be of the same order than the crystal field splitting when the cluster 
has of the oder of 10® valence electrons. This number is far larger than the 
size region where the shell structure or supershell structure is observed. In 
fact, it turns out that the surface effects cause much stronger disturbance 
to the shell structure than the pseudopotential lattice inside the cluster. 

Three different properties of the surface have effects on the electronic 
shell structure: softness of the surface potential, surface roughness, and 
faceting. The softness can be described by a model potentials (9) or (10) 
and has found to have rather small effects on the shell structure of small 
clusters. Only extreme softness (which will prevent classical triangular or- 
bits of electrons) can drastically change the shell structure [20,27]: magic 
numbers would still exist but with different electron numbers. 

The surface roughness is related to the crystal field splitting caused by 
the pseudopotentials inside the metal. However, since the periodicity of 
the lattice is broken at the surface the effect is much stronger. Several 
different models [28-32] have been applied for studying the effect of the 
surface roughness. (Simplest way to study it is to use the tight binding 
model described in the next section.) The roughness widens the discrete 
degenerate electron shells and make the density of state smoother. The 
shell structure is still there but it is weaker. The surface roughness also 
increases the total energy of the cluster (this is the reason why a simple 
pseudopotential correction to the spherical jellium model fails). 
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Fig. 3. Energy levels in a potential well corresponding to a 309 atom sodium 
cluster. The long lines are results for a spherical well, the short lines on the 
left-hand side are results for a cuboctahedral well and the short lines on the right- 
hand side are results for an icosahedral well. For a nonspherical cluster the angular 
momentum is only approximately correct. From reference [37]. 



The surface of a crystalline material tends to be faceted, consisting of 
only low-index facets (in FCC lattice [111], [HO], [100]). In finite clusters the 
minimization of the surface energy leads to so-called Wulff construction [33] 
which is a regular polyhedron. In small clusters also noncrystalline faceted 
clusters like icosahedra and decahedra are possible. Indeed, experiments 
have shown that large sodium clusters are icosahedra [34] while aluminium 
clusters seem to have octahedral shapes [35] . Generally, it seems that when 
the surface faceting determines the geometry, the electronic shell structure 
does not any more have a marked effect of the cluster stability. It is then 
impontant to know the effect of the faceting to the electronic shell structure. 
Mansikka-aho et al. [36, 37] have used two different models to study the 
effect of the icosahedral shape and Wulff construction to the electronic shell 
structure. The results show that the Wulff construction in in an FCC lattice 
destroys the shell structure when the cluster has about 300 electrons while 
the icosahedral shape still has the shell structure when the cluster has about 
1000 electrons. Figure 3 shows the effect of the surface faceting on the 
electronic structure in the cases of icosahedral and cuboctahedral shapes. 
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5 Tight-binding model 

Before studying further the effect of the surface and deformation on the 
electronic structure we will introduce another simple model. The tight 
binding model is often considered of being an opposite to the nearly free 
electron model. While in the latter the basis functions are delocalized plane 
waves, in the tight binding model one starts from localized orbitals. The 
simplest tight binding model assumes only one s-electron per atom, neglects 
the differential overlap between orbitals and assumes coupling only between 
nearest neighbours. This model is frequently called the Hiickel model or the 
CNDO model (complete neglect of differential overlap). 

The hamiltonian to be diagonalized is 

{ a if i = j 

—(3 if i and j nearest neighbours (5.1) 

0 otherwise 

where a is the energy eigenvalue at a free atom and [3 so-called hopping 
integral between nearest neighbours. Without restrictions we can use Hiickel 
units by taking a = 0 and (3=1. With this simple Hamiltonian it is easy 
to solve the energy eigenvalues for infinite lattices with periodic boundary 
conditions or for finite clusters with up to tens of thousands of atoms. The 
total energy of the simple tight binding model can be defined as the sum of 
the occupied energy levels. In determining the geometry the bond length is 
usually kept fixed and the atoms are not allowed to be closer to each other 
than the bond length (hard spheres). 

Even if the model is so simple, it has turned out to be extremely difficult 
to find the optimum geometry (which minimizes the total energy) as soon 
as the cluster has more than 10 atoms. Wang et al. [38, 39] computed 
the ground state geometries for Hiickel clusters up to 9 atoms by testing all 
possible topologies. It is interesting to note that the ground state geometries 
obtained with the Hiickel model are very similar to those obtained with 
ab initio calculations [3,40] for alkali metals. For larger clusters simulated 
annealing technique [41] and genetic algortithms [42] have been used to 
look for the ground state geometry. However, the Hiickel model seems to 
be a much more difficult optimization problem than for example finding the 
ground state geometries for Lennard-Jones clusters [43]. 

In studying large spherical Hiickel clusters Lindsay et al. [44] observed an 
electronic shell structure, similar to that of the jellium model. These clusters 
were cutted off from cubic lattices and contained hundreds of atoms. 

For understanding these surprising results two different considerations 
have have to be made. We will first show the equivalence between the 
Hiickel model and the free electron jellium model in spherical clusters. 
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The understanding of the insensitivity of the overall shape of small clus- 
ters on the theoretical approach is slightly more complicated and will be 
discussed in the next section. Consider now the single particle Schrodinger 
equation in an infinite jellium. The potential is constant and thus the so- 
lution is a free electron plane wave. However, lets assume that we do not 
know that, but instead try to solve the equation numerically in a three- 
dimensional cubic grid with a lattice constant a. The Schrodinger equa- 
tion then becomes 






- cv^(r,) 



= eV’(ri), 



(5.2) 



where the sum goes over the nearest neighbours and c is the coordination 
number {i.e. the number of nearest neighbours). It is now obvious that 
the solution of the group of equations (5.2) is equivalent of finding the 
eigenvalues and eigenvectors of the Hamiltonian (5.1) with 



a = 



Ti^ c 
2ma^ ’ 






2ma^ 



(5.3) 



The tight binding model in a cubic grid is thus an approximation to the free 
electron model. This explains the similar shell structure at low energies. We 
should notice, however, that the “numerical” results are accurate only when 
the variation of the wave function is small within the distance a. This means 
that if a is a true lattice constant the Hiickel model is only accurate close 
to the bottom of the conduction band. Another way to state the similarity 
of the Hiickel model and the nearly free electron model is that in the case 
of an isotropic effective mass the bottom of the energy band always looks 
like a free electron band. 

The numerical solution of the Hiickel model has been used to study the 
shell structure of nonsherical clusters like icosahedra [36] , cuboctahedra [36] 
and tetrahedra [45] and effect of the surface roughness on the shell structure 
and level spacing statistics [28,29]. 



6 Shape deformation 

Computations of the atomic structure of metal clusters have shown that 
small clusters prefer shapes that generally are not closed packed structures 
of hard or soft spheres as would be a result of a pairwise interaction {e.g. 
Lennard- Jones clusters). For small alkali metal clusters the basic geome- 
tries seem to be quite insensitive to to accuracy of the treatment of the 
many-body problem of electrons and ions: ab initio quantum chemical meth- 
ods (Cl), Kohn-Sham method with pseudopotentials, and even the simple 
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Hiickel model gives quite similar planar structures for sodium clusters up 
to 5 or 6 atoms and nearly the same structures also for clusters with 7 or 8 
atoms. This suggests that there should be a simple single-particle explana- 
tion which determines the basic structures. 

Experimentally, the observation of the geometries is not straightforward 
and the information of geometries of small clusters comes only from indirect 
measurements. The photoabsorption and photoelectron spectra seem to be 
sensitive to the detailed geometry of the cluster, but both methods need 
a comparison of theoretical and experimental results for determining the 
structure [3,46]. A simple theory of photoelectron spectroscopy is shortly 
explained in Section 9. In large clusters with more than, say, 40 atoms, there 
is only very limited experimental knowledge of the cluster shapes (expect 
in some cases of large clusters where diffraction measurents have confirmed 
the clystal lattice [48], or the mass spectrum had revealed the growth pat- 
tern [34,35]). Nevertheless, the splitting of the plasmon resonance can be 
explained by an overall prolate or oblate geometry [49] and the intensity 
variation of the mass spectrum has been explained with help of deforma- 
tion [50,51]. 

In the jellium model the deformation can be understood as a Jahn-Teller 
effect, which tries to reduce the degeneracy of an open shell structure. First 
ellipsoidal deformation with constant background density was considered 
[52]. This model already gives good understanding of the basic features in 
the small cluster limit. It predicts correctly that in the beginning of a shell 
filling the shape is prolate and at the end of the shell filling it is oblate. Also, 
it explaines the odd-even staggering observed in the ionization potential of 
small alkali metal clusters. The simple explanation of the deformation in 
the first p-shell is the following: if only one or two electrons are in the p- 
shell they can occupy the same, say, Px state along the elongated axis of 
the prolate ellipsoid. The 4th asd 5th electron occupy the p^-state and the 
cluster prefers an oblate shape. The analysis becomes more complicated 
in filling higher angular momentum shells but, nevertheless, it is the single 
particle picture which explains the deformation also in large clusters [53,54]. 

The deformed jellium model can be improved by relaxing the require- 
ment of constant background density. This makes also the numerical com- 
putation of the Coulomb energy and potential easier. In the ultimate limit 
the lowest energy is obtained by allowing the background density at ev- 
ery point to be the same as the electron density [55,56]. In this so-called 
ultimate jellium model the background density of the bulk has the value 
corresponding to the minimum energy of the homogeneous electron gas. 
The cluster shapes are entirely determined by the electrons. The results 
for small clusters are shown in Figure 4. The shapes are more complicated 
than just ellipsoids. Nevertheless, it is important that these shapes are not 
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Fig. 4. Ground state shapes of clusters with 2 to 22 electrons (with increasing 
order) calculated with the ultimate jellium model. A constant density contour 
corresponding to about 40% of the bulk density is shown. 

at all sensitive to how the exchange-correlation potential is approximatied 
in the Kohn-Sham theory. In fact, essentially the same shapes are obtained 
if parametrization for nuclear matter is used [57]. 

Hakkinen et al. [58] have studied the universality of the shapes of fermion 
clusters in the local density approximation of the density functional theory. 
Indeed, for small cluster the multipole expansion of the shape shows that the 
shape parameters are nearly independent of the interparticle interactions. 
This allows accurate calculation of internal nuclear quadrupole moments, 
eventhough the equation of state of the nuclear matter is not well known. 
The results are shown in Figure 5. It should be noted that the simple model 
only applies for nuclei with the same number of protons and neutrons. 
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Fig. 5. The internal quadrupole moments of nuclei shown as a deformation pa- 
rameter 020 (see [56]) as a function of the number of protons. The experimental 
results are shown a stars. The results of the universal model are shown as black 
dots connected with a solid line. The other lines show results of other models. 
From [73]. 



The reduction of the total energy due to the defromation is quite dra- 
matic. In Figure 6 we show the total energy per electron of the ultimate 
jellium model as compared to the spherical jellium model for sodium. The 
energy does not any more increase in between the magic numbers and there 
is a strong odd-even staggering. It is also interesting to note that the total 
energy of the ultimate jellium model is not very different from the conven- 
tional jellium model for the same density (the bulk density of the ultimate 
jellium model is rg = 4.2 while the spherical jellium result in Fig. 6 is for 
rg=4.0). 

The key quantity for determining the shape in the single particle picture 
is the sum of the single particle eigenvalues. The eigenvalues depend on the 
shape more sensitively than the integrated parts of the total energy func- 
tional of the density functional theory [61]. The coupling of the eigenvalue 
spectrum to the shape was first utilized in the Nilsson model [59] for nu- 
clei (as explained by Mottelson in the 1957 Les Houches lectures [60]) and 
with the same model by Clemenger [53] for clusters. The Nilsson model is 
based on the harmonic oscillator where the energy eigenvalues have a sim- 
ple analytic expression. To demonstrate that the shape is insensitive to the 
details of the single particle potential we write the following expression for 
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number of electrons 



Fig. 6. Energy per electron for the ultimate jellium model (solid line) compared 
to the spherical jellium model with LDA (dotted line) and LSD A (dashed line). 



the eigenvalues [61] 

e(ni, U2, ns) = uinj + ^2^2 + wsUg + ( 7/2 — l)(wi + W2 + W3), (6.1) 

where rii are positive integers (zero not included) and 7 is a parameter 
which interpolates the eigenvalues between the harmonic oscillator 7=1 
and a rectangular box 7 = 2 . We use a volume conservation wiu;2W3 = Wq 
and assume an “axial” deformation oji = A deformation constant 8 can 
then be defined as 

W3 = Wo(l - i8)^ UJI=UJ 2 = 

Here the exponent 7 guarantees the correct single particle energy and the 
second equation follows from the volume conservation. The coefficient ^ = 
2(0.51 + 0 . 497)73 is a purely goemetrical factor introduced to take into 
account the (5-dependence of the moment of inertia. In the case of the 
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Fig. 7. Deformation as a function of the parameter 7 of the single particle spec- 
trum. 7 = 1 corresponds to a harmonic oscillator and 7 = 2 a rectangular box. 
Different lines correspond to the electron numbers indicated. 



harmonic oscillator the deformation parameter is the same as frequently 
used in nuclear physics [62]. 

Figure 7 shows the dependence of the optimal deformation parameter 
on the single particle spectrum, i.t. parameter 7, for even cluster from 4 to 
18 particles. In the cases of 8 and 20 particles the cluster is not deformed and 
6 = 0. We assume that each level can occupy two electrons due to the spin 
and that the total energy is the sum of the occupied single particle levels. We 
can see that the 4 electron cluster is prolate i5 > 0 and the 6 electron cluster 
is oblate <5 < 0 for all values of 7. In filling the d-shell the situation becomes 
more complicated: the clusters can flip between the prolate and oblate 
isomers when 7 is increased. This means the there are two isomers which 
are quite close in energy. This situation is known from the Nilsson model. 
However, it is more important to notice that the deformation parameter 
changes only slowly as a function of the parameter 7 which describes the 
single particle spectrum. This is the key for understanding the insensitivity 
of the shapes of small clusters on the details of the system in question. 
Finally, we whould mention that the Nilsson model can be extended to a 
quantitative model by the Strutinsky shell correction method [63], which 
has been suggesfully used for metal clusters [54, 64, 65] . 
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7 Tetrahedral and triangular shapes 

Two-dimensional triangular and three-dimensional tetrahedral clustes form 
a special class of deformed clusters. Electrons confined in a triangular cavity 
have a simple shell structure that can be solved analytically [66] . Similarly 
the periodic orbit theory shows that there are only two important periodic 
orbits in a triangle. The regular shell structure is accompanied with a 
regular supershell structure [67]. The shell closings are the same as in the 
2D harmonic oscillator. In the case of the 3D tetrahedron, the situation is 
not so simple: an analytic solution of the energy levels (in a cavity) do not 
exists and there are no nontrivial periodic orbits. However, the first shell 
fillings happen at the same electron numbers as in a 3D harmonic oscillator, 
i.e. 8, 20, 40, 70, 112 [45]. 

An octupole deformation will transform a sphere into a tetrahedron. 
Hamamoto et al. [68] have shown that a single-particle picture, related to 
that of the Nilsson model, indeed will lead to an octupole deformation with 
particle numbers 40, 70 and 112. This has been confirmed with the ultimate 
jellium model [45], which also shows that the tetrahedral deformation has 
an important role in stabilizing 40 as a strong magic number in the jellium 
model. 

The triangular and tetrahedral clusters have an interesting possibility 
of making “double magic” clusters. Complete tetrahedra are closed packed 
FCC structures having 4, 10, 20, 35, 56 etc. atoms [69]. When multiplied 
by two these numbers equal to the shell closings in tetrahedral cavity. A 
tetrahedron consisting of divalent metal atoms has then a closed shell of 
atoms and a closed shell of electrons. Ab initio electronic structure cal- 
culations show that indeed a 35 atom magnesium tetrahedron has a clear 
electronic shell structure [45] . Similarly, in two-dimensional clusters the tri- 
angles of divalent atoms would be “doubly magic”. Nevertheless, the sharp 
corners of tetrahedra and triangles are likely to make them energetically less 
favourable than more rounded geometries. The existence of double magic 
triangular clusters is still uncertain. 

8 Odd— even staggering in metal clusters 

The ionization potential of alkali metal clusters as a function of the number 
of atoms show a clear odd-even staggering. The ionization potential of an 
even cluster is larger than that of the neighbouring odd clusters [50] . Sim- 
ilar odd-even staggering is also seen in the abundances of clusters in the 
mass spectra and in the measured binding energies. In the case of small 
clusters ab initio calculations of quantum chemistry [3] give a quantitative 
description and explanation to the observed odd-even staggering. Neverthe- 
less, the similarity of the odd-even staggering between different metals and 
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its similarity to that observed in nuclear physics calls for a simple model for 
a qualitative explanation. 

According to the previous section the deformation tries to remove the 
degeneracy of the many-body state arising of a open electronic shell. The 
reason for the odd-even staggering is the spin-degeneracy, which the defor- 
mation can not remove. Simple models, like the Clemenger-Nilson model, 
already explains the odd-even staggering [53] . In the limit of small clusters 
the odd-even staggering shows similar universality as the deformation [58]: 
it becomes nearly independent of the system in question and the odd-even 
staggering is quantitatively similar (in properly scaled energy units [58]) in 
clusters and in small nuclei. 

In atomic clusters the detailed atomic geometry causes that the odd- 
even staggering is not as smooth function of the particle number as in the 
case of nuclei. However, at high temperatures where the clusters will sample 
a large collection of isomers, the average odd-even staggering of the Fermi 
energy will be smoother. The effect of the temperature on the odd-even 
staggering has been studied in [70] using several simple models. 

Qunatitatively the odd-even staggering of the total energy can be deter- 
mined as the absolute value of the second derivative of the total energy as a 
function of the number of electrons. Figure 8 shows the odd-even staggering 
calculated using the ultimate jellium model with the parameters suitable for 
nuclear matter. The figure also shows the experimental results for nuclei. 
For nuclei with less than about 20 protons and neutrons the deformation 
counts most of the odd-even staggering while in larger clusters the experi- 
mental staggering is much stronger. Also, in nuclei the odd-even staggering 
is a much smoother function of the particle number than in the results of 
the ultimate jellium model. 

Figure 8 also show the experimental results for atomic clusters [71]. At 
smallest sizes the clusters show nearly as strong staggering as the nuclei, 
but the staggering gets weaker when the cluster size increases. Part of this 
rapic decrese might be an effect of the finite temperature of the clusters. 

Finally, it should be noted that in nuclear physics the pairing of nucleons 
in accordance with the BCS-theory [62] seems to be an important element 
in the observed odd-even staggering. This is especially the case in large 
nuclei. From Figure 8 we can see that when the nucleus has more than 
about 20 protons (and neutrons) the deformation can account in average 
only about half of the observed odd-even staggering of the total energy. In 
the case of metal clusters the binding energies of Cooper pairs are orders of 
magnitude smaller than the observed odd-even staggering in the ionization 
potential. Consequently, in metal clusters the odd-even staggering is solely 
a deformation effect. 
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N|E(N + 1)-2E(N) + E(N-1)| 




Fig. 8. Odd-even staggering of the total energy. Experimental results for nuclei 
are shown as stars, the results of the ultimate jellium model are shown as black 
dots. The experimental results for atomic clusters are shown as plus-signs. The 
open circles show theoretical results of from [72]. N is the number of protons and 
neutrons in the nucleus or the number of electrons in the case of atomic clusters. 



9 Ab initio electronic structure: Shape and photoabsorption 

Above we have considered clusters mainly as islands of electrons and the 
ions are used only to provide a smooth background to confine the elec- 
trons. The treatment of the true ionic sceleton together with the electrons 
can be done either using a perturbation theory or so-called ab initio com- 
putations. The ab initio calculations can be made with several different 
degrees of approximations. The traditional physical chemistry approach is 
to use Hatree-Fock or Configuration Interaction (Cl) methods. The latter 
approach has the advantage of being essentially exact for small electron 
numbers and of providing the electronic excitation spectrum which often 
can be directly compared with experiments [3] . 

Another approach is to use the density functional Kohn-Sham method 
together with proper pseudopotentials. Here the advantage is that large 
electron or atom numbers, up to thousands, can be treated. In any ab initio 
method the atomic positions can be determined by minimizing the total 
energy. The Born-Oppenheimer approximation allows us to use classical 
molecular dynamics and, for example, to do simulated annealing for looking 
the ground state geometry. This method is often called ab initio molecular 
dynamics or “quantum molecular dynamics” since the forces are determined 
by solving the electronic structure. This can be done either adiabatically [74] 
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Fig. 9. Comparison of measured (solid lines) and calculated (dashed lines) pho- 
toemission spectra of Al^g at three different temperatures. From reference [46]. 



or by letting the electron wave functions also to be as dynamical variables 
as done in the Car-Parrinello method [75]. 

As an example we will study the photoionization of aluminium clusters. 
In this method a laser is used to ionize the cluster and the kinetic energy of 
the outcoming electron is measured. The kinetic energy reflects the binding 
energy of the electron in question and, in the single-electron picture, the 
density of electron levels can be measured. In practice, if mono-size clusters 
are studied the measurement has to be made for cluster ions or anions. 

The Koopmans’ theorem for Hartree-Fock approximation suggests that 
the kinetic energy of the photoelectron could be approximated by the single 
particle eigenvalue of the electron. In the case of the Kohn-Sham method 
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Fig. 10. Photoemission spectra of Aljg isomers. Solid lines are calculated spectra 
for isomers shown. The dashed line is the experimental spectrum. The lowest 
energy isomer gives the best agreement with the experiment. From reference [47]. 

it can be shown that the Koopmans’ theorem does not give the correct 
ionization potential, but the single particle eigenvalue has to be corrected 
by the asymptotic limit of the exchange-correlation potential [46] . Assuming 
that the same is true also for the detachment of electrons from the deeper 
single particle levels, we can approximate that the photoelectron energy as 

£lkin = + Ci - (9.1) 

where hv is the photon energy, the energy energy eigenvalue of the electron 
in the cluster, and the asumptotic limit of the exchange-correlation 
potential. Model calculations have shown that is nearly independent of 
the energy eigenvalue in question and can be approximated by a constant 
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[47]. Assuming further that the photoemission propability is equal for all 
electrons, the photoemission spectrum gives the density of states of the 
cluster. At zero temperature this consists of discrete lines. At a finite 
temperature the ionic motion moves the single particle levels up and down 
giving a width to the peaks. Moreover, at high temperatures the cluster 
will sample several isomers and the density of states will be an average over 
the isomers. 

Ab initio molecular dynamics provises a tool to average the density of 
states at a desired temperature. Figure 9 shows the measured photoemission 
spectrum for Al ]”3 at three different temperatures compared to calculated 
densities of states. Clearly this type of experiments and computations can 
give information of the cluster temperature. 

The electronic structure of the cluster depends on the ionic positions. 
Consequently, the photoemission spectra measured at low temperatures to- 
gether with the computed density of states can be used to determine the 
cluster goemetries. Figure 10 shows as an example the photoemission spec- 
trum of Al]”g compared to calculated spectra for different geometries shown 
in the figure. In this case only the isomer corresponding to the lowest total 
energy gives a good agreement with the measured spectrum. This gives 
confidence that properties of small clusters can be rather reliably computed 
using the Kohn-Sham method and ab initio molecular dynamics. 



10 Quantum dots: Hund’s rule and spin-density waves 

In semiconductor heterostructures the conduction electron gas can be con- 
fined to move in a two-dimensional (2D) layer. There are several ways to 
further confine a finite number of electrons to form a quantum dot [76,77]. 
Typically, in a quantum dot the electrons move in a nearly harmonic poten- 
tial which can have a circular or elliptic symmetry. The electronic properties 
of the quantum dot are most often measured by studying the electron trans- 
port through the dot. We will not go into the details how the quantum dot 
is manufactured or the measurements done, but concentrate on the study of 
the interesting quantum mechanics of a small number of electrons in a 2D 
harmonic oscillator. However, it should be emphasized that the electrons 
in question are the conduction electrons of the semiconductor. They are 
characterized by a small effective mass and very low density. An external 
magnetic field can fully polarize the electron gas and lead to new many-body 
effects as most clearly manifested in the quantum Hall effect. 

Noninteracting electrons in a quantum dot have a shell structure of the 
2D harmonic oscillator. The shell fillings (magic numbers) occur at elec- 
tron numbers 2, 6, 12, 20 etc. The electron-electron interaction will reduce 
the large degeneracies of the harmonic oscillator potential due to the fact 
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that the effective single particle potential is not harmonic. The situation is 
similar to the spherical jellium model for clusters: the electrons move in a 
harmonic well, but the effective mean field potential is not harmonic. Nev- 
ertheless, the harmonic confinement means that, like in metal clusters, a 
collective plasmon excitation corresponding to the center of mass motion is 
the dominating dipole excitation. Again, the density functional Kohn-Sham 
formalism can be used to study the electronic structure. The only difference 
from the jellium model is that in the case of quantum dots the system is 
two-dimensional and consequently in the local density approximation the 
exchange-correlation potential is different from that of the 3D case [78]. 




Fig. 11. Effect of the deformation on the second derivative of the total energy in 
ellipsoidal quantum dots (lower panel). The dashed line shows the result for the 
circular dots with magic numbers 2, 6, 12 and 20. Note the maxima at 4 and 9 
caused by the Hund’s rule. The solid line shows the result for an ellipsoidal dot 
and dashed line for a quasi-one-dimensional wire. The deformation reduces the 
shell structure as in metal clusters. The upper panel shows the total spin of the 
ellipsoidal dot. (In spherical dots the spin is determined by the Hund’s rule, being 
zero at magic numbers and at maximum at half- filled shells). 
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In the case of an open shell quantum dot the spin-density functional 
theory should be used to allow the minimization of the total energy with 
respect to the total spin. If this is done the shell fillings obey the Hund’s first 
rule in agreement with the experimental findings [79]. Figure 11 shows the 
second derivative of the total energy as a function of the electron number in 
a circular and deformed quantum dots. The magic numbers corresponding 
the full shells are clearly seen as well as the maxima corresponding to the 
half- filled shells at electron numbers 4, 9 and 16. The Hund’s rule is caused 
by the breaking the spin-symmetry which separates the spin-up and spin- 
down energy levels. In the case of a half- filled shell this opens a gap at 
the Fermi level and reduces the total energy in a similar fashion as in the 
spherical jellium model for metal clusters. In reality the metal clusters can 
spontaneously deform and the Hund’s rule does not have any role, while 
in quantum dots the shape is fixed and the Hund’s rule can be observed 
experimentally. 

In a large quantum dot there is, however, another mechanism to increase 
the gap at the Fermi level. This is a static spin-density wave: the total spin 
is zero but the spin-polarization is an oscillating function. The LSDA cal- 
culations show that for example the quantum dots with 24 and 34 electrons 
have a spin-density wave in their ground states when the electron density 
is low enough [80]. 

The spin-density waves are even more clearly seen in the quasi-one- 
dimensional systems such as quantum wires and quantum rings. In an 
infinite one-dimensional wire it can be understood as a manifestation of the 
spin-Peierls transition [81]. In a ID metal the Fermi surface consists of only 
two points. Thus any periodic potential with the periodicity determined 
by the Fermi wave length will open a gap at the Fermi surface and reduce 
the total energy. The spin-density wave will provide a periodic potential 
(opposite phase for spin-up and spin-down electrons) which opens the gap 
[82]. 

Figure 12 shows the spin-density wave for a ring of 12 electrons calculated 
using the local density approximation. The advantage of the local density 
approximation is that the existence and origin of the spin-density wave is 
easy to determine. The Kohn-Sham theory as a mean field theory can 
brake the symmetry of the system and reflect the internal symmetry of the 
many-body state. 

In principle, in circular systems the symmetry of the Hamiltonian re- 
quires that the total electron density and spin-density have also the circular 
symmetry. The existence of the spin-density wave can then be seen only in 
the internal structure of the many-body wave function. This can be done 
by studying correlation functions. However, since the Pauli exclusion prin- 
ciple already dictates certain correlation between spin-up and spin-down 
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Fig. 12. The total electron density and the spin-up electron density in a narrow 
ring of 12 electrons calculated using the LSDA approximation. The spin-down 
density is in the opposite phase than the spin-up density. Together they form an 
antiferromagnetic ring of electrons. 



electrons, it is not easy to see the onset of a spin-density wave state by 
lookling, for example, at the pair correlation function. 

In the case of quasi- ID-rings the existence of the internal spin-density 
wave can be most easily seen by studying the excitation spectum of the 
many-body state [83]. Figure 13 shows the rotational spectrum of a six- 
electron ring calculated using an essentially exact Cl method. The spectrum 
is characterized by a low-energy band which has a six-fold symmetry: the 
energy differencies and spin-assignments of the M = 0 and M = 6 are 
equal, similarly M = 1 and M = 2 states are similar to those with M = 5 
and M = A. This spectrum can be explained quantitatively with a model 
Hamiltonian 

H = + + (10.1) 

ij V 

where the first term is an antiferromagnetic Heisenberg Hamiltonian [5] 
(with coupling constant J), the second term describes rigid rotations of a 
ring of electrons (with moment of inertia I), and the last term describes 
vibrations of localized electrons. 

The lowest band of Figure 13 corresponds to the vibrational ground 
state. The spin-structure of the band is determined by the Heisenberg 
model and the overall rise with the angular momentum comes from the 
rigid rotations. This result shows that the electrons in a quasi-lD-ring are 
localized in an antiferromagnetic chain. Consequently, the internal struc- 
ture indeed has the spin-density wave suggested by the simple local density 
approximation (Fig. 12). This example shows that the study of the inter- 
nal structure of the complicated many-body wave function can indeed be 
cumbersome and the simple models often give a more transparent picture of 
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Fig. 13. Exact many-body spectrum of a quantum ring with six electrons. The 
numbers indicate the total spins of the lowest states. 



the key properties of the system. Note that it is the structure of the model 
Hamiltonian that tells us that the system is an antiferromagnetic chain of 
localized electrons. By solving the density or the spin-density of coming 
from the model Hamiltonian does not leed to anything since both are circu- 
larly symmetric. The pair-correlation function does not tell us much either, 
since it is known that the correlation in a ID Heisenberg model decays as 
1/r, r being the distance between the spins. 

11 Deformation in quantum dots 

The main difference in studying the deformation of metal clusters and quan- 
tum dots is that while in the metal clusters the shape is determined by the 
electronic structure due to the Jahn-Teller effect in quantum dots the shape 
is determined by the external confinement caused by external electrodes or 
the actual shape of the dot structure. In quantum dots the electronic struc- 
ture can then in principle be freely chosen. Nevertheless, the external po- 
tential in a quantum dot is always nearly harmonic, at least at the bottom 
of the well. In the case of a small number of electrons the most important 
deformation is then ellipsoidal. The simple Nilsson diagram [59] can be used 
to study the effect of deformation to the shell structure. Since the deforma- 
tion redistributes the energy levels and opens gaps at different places than in 
the circular case, the magnetism (Hund’s rule) and magic numbers depend 
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on the deformation, as illustrated in Figure 11. For example, the magnetic 
moment of the quantum dot can be changed by deformation caused by ex- 
ternal electrodes. Reimann et al. [84] have studied in detail the effect of the 
deformation on the magnetism in quantum dots. 

In layered semiconductors it is possible to construct a structure where 
an electron quantum dot is on top a quantum dot of holes. In this case the 
electrons and holes can bind each other without external confinement. The 
shape is then free to deform like in the case of metal clusters. The structure 
of the electron and hole droplets will depend on the distance between the 
layers and on the number of particles. If the distance between the electron 
and hole layers is small compared to the dot size we can approximate that 
the electrons and holes are in the same layer in calculating the electronic 
structure. The effect of the separating layer is then only in preventing 
recombination. Futhremore, if the number of electrons and holes is the 
same and they have the same effective mass, we end up in a model which is 
a two-dimensional version of the ultimate jellium model described earlier. 
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Fig. 14. Contours of the ground state densities of 2D electron-hole clusters calcu- 
lated using the ultimate jellium model. The right column has the magic clusters. 
From reference [85]. 
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In the ultimate model the electron and hole densities (which are equal) 
are determined by the properties of the 2D electron gas: There is no aver- 
age Coulomb interaction and the cluster is free to deform. Figure 14 shows 
the optimal shapes of such a model [85]. It is seen that the magic clusters 
have triangular shapes in addition to the spherical ones. This is related 
to the tendency of octupole deformation of some magic clusters in 3D case 
discussed earlier. Moreover, the freely deformed shapes in 2D seem to have 
a much richer variety of shapes than in the 3D case. In this model the quan- 
tum dot is always nonmagnetic, since the deformation can more effectively 
open a gap at the Fermi level than the magnetization. The situation is the 
same as in the 3D case. 

Using the Kohn-Sham method it is straighforward to improve the model 
of layered electron/hole dots. The actual distance between the dots as well 
as the interdot Coulomb potential can be taken into account, as well as 
the necessary external confinement. However, in the case that the external 
confinement is weak in comparison to the interdot Coulomb potential the 
spontaneous deformation of the electron and hole clouds in the quantum 
dot should be possible. In principle the same should happen in 3D droplets 
of electron-hole plasma [55] . 



12 Localization of electrons in a strong magnetic field 



The external magnetic field is described in quantum mechanics by a vector 
potential (B = V x A) which is added to the momemtum operator (min- 
imal substitution). In the density functional Kohn-Sham formalism part 
of the kinetic energy of the interacting electrons is hidden in the exchange- 
correlation energy. Consequently, it is not enough to include the magnetic 
field contribution only to the kinetic energy operator of the single parti- 
cle Hamiltonian, but also in the effective potential. This leads to so-called 
current-density functional formalism developed in detail by Vignale and 
Rasolt [86]. 

The current-density-functional method is rather complicated due to the 
diffucult coupling between the exchange-correlation energy and the vector 
potential. We do not desctribe the formalism here (see Refs. [86,87]), but 
only state that the single particle Kohn-Sham equation now is 



— (-fW- 
2m ^ 



A(r))VKff(r) 



‘4^ia — 



( 12 . 1 ) 



where vector potential includes a contribution coming from the electron 
currents and the effective potential depends on the vector potential. Ves 
also includes the direct interaction of the external magnetic field with the 
electron spin, so-called Zeeman term. As in the normal Kohn-Sham method 
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the equations can be solved iteratively. The results will give the total spin, 
electron and spin densities and the internal currents. 

The actual numerical solution of the equations is quite demanding. How- 
ever, some general features can be understood by looking at the noninteract- 
ing electrons in a 2D harmonic oscillator in the presence of a perpendicular 
magnetic field. This problem can be solved analytically and the single par- 
ticle levels form so-called Darwin-Fock basis and energy levels 

CnL = ( 2 n -|- 1 -l- \L\)h^ 4 "^^ ~ '^LTiuJc + g* (X bSzB , (12.2) 

where L is the orbital angular momentum and lOc and ojq the cyclotron 
and confinement frequencies, respectively. The last term takes into account 
the Zeeman splitting (g* is a reduced Lande factor of the semiconductor). 
The cyclotron frequency is proportional to the magnetic field. In a high 
magnetic field only the lowest Landau band is filled (n = 0) and each 
harmonic oscillator state will have only one electron corresponding to the 
maximum orbital angular momentum. The electron density distribution will 
be homogeneous, so-called maximum density droplet. This happens also in 
the case of the interacting electrons in the quantum dot. The only difference 
is that in this case the single particle states are not exactly the Darwin-Fock 
states but solutions of the circular problem with a more complicated effective 
potential. 

The Zeeman splitting can be so small that a maximum density droplet 
is first formed with both spin states equally populated. When the field 
increases the droplet with spin-up states starts to shrink and that of the 
spin-down states expand until all electrons populate the spin-down states. 
Up to this point the interacting electron system behaves quite similarly with 
that of the noninteracting electrons, the only differences being in the order 
the single particle levels are filled before the maximum density droplet state 
is reached. 

The electron-electron interactions change the situation drastically when 
the magnetic field is further increased. In the noninteracting case the maxi- 
mum density droplet continues becoming smaller in size. In the interacting 
case the Coulomb repulsion prevents the shrinking of the droplet forcing 
electrons from small angular momentum values to move to higher angular 
momentum values. In the density this is seen as a ring of electrons sep- 
arating from the maximum density droplet [88]. The current-spin-density 
formalism gives a result that this ring consists of localized electrons as shown 
in Figure 15. Each maxima at the ring corresponds to a single current vortex 
consisting of one electron. Increasing the magnetic field eventually makes 
all the electrons to localize into single “vortices” . 

The localization of electrons in the density functional model is an other 
example of an internal symmetry breaking of the system. Again, as in the 
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N=20 




Fig. 15. Electron density in a circular quantum dot in a high magnetic field. The 
Maximum density droplet is shown in the left. Increasing the field separates a 
ring of electrons as shown in the right. 



cases of the deformation of clusters or spin-density waves in quantum dots, 
the symmetry of the Hamiltonian requires that in the laboratory frame of 
reference the total density should have a circular symmetry. The density 
functional theory, as a mean field theory, can reduce the energy by breaking 
the symmetry of the effective single-particel potential. Similar localiza- 
tion of electrons in high magnetic fields is obtained also in the unrestricted 
Hartree-Fock calculations [89]. 

Exact many-body calculations, like Cl, naturally give a symmetric total 
electron density, and correlation functions have to be studied to see the in- 
ternal localization of electrons [90]. Another possibility would be to study 
the excitation spectrum of the many-body states as done above in the case 
of a quantum ring where the antiferromagnetic Heisenberg model explained 
the electron localization. In the presense of a magnetic field, however, the 
computations are less accurate and the analysis of the rotational spectra 
might not be as simple. However, there is another way to reveal the elec- 
tron localization. By reducing the symmetry of the external confinement 
from the circular symmetry, the symmetry broken solutions become allowed. 
Koskinen et al. [91] have performed “exact” Cl calculations for ellipsoidal 
quantum dots in a high magnetic field and indeed obtained the localization 
of electrons. The results are shown in Figure 16. 

The above discussion and results show that the local current-density ap- 
proximation can quite well describe the electrons in a magnetic field and 
give information of the internal structure of the many-body wave function 
which is not always so easy to see from the exact result (even if it is avail- 
able) . The basic reason for this is that the system is dominated by the single 
(quasi) particle picture. This should brake down in even stronger magnetic 
fields where the fractional quantum Hall regime is reached. In this case 
the many-body wave function can be approximated by the Laughlin wave 
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Fig. 16. Exact electron densities in ellipsoidal quantum dots containing six elec- 
trons. The upper panel shows the maximum density droplet and the lower panel 
the localization of electrons in a high magnetic field. 

function [92] 




where z = x + iy and the x and y-coordinates are in the units of the mag- 
netic length. The exponent q has to be odd (for fermions). The fractional 
quantum Hall regime is reached when q = 2>. 

It is interesting to note that for q = 1 the Laughlin wave function is just 
the maximum density droplet (for noninteracting particles), i.e. a Slater 
determinant of single particle states. For g = 3 it is a rather complicated 
superposition of Slater determinants made of single-particle states of the 
lowest Landau band. It seems unlikely that the local density approximation 
could describe the Laughlin state properly. Nevertheless, suggestions has 
been made how to include the fractional quantum Hall regime into the 
exchange-correlation functional. 
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13 Conclusions 

In this chapter we have tried to illustrate that simple single-particle mod- 
els can give good understanding of many features of finite electron systems 
like metal clusters and quantum dots. The underlaying theory is the den- 
sity functional Kohn-Sham method which formally folds the many-body 
problem into a single particle problem. 

In the Kohn-Sham formalism all the quasi-particles move in the same 
effective potential. Due to the self-consistency of this potential the method 
allows a spontaneus symmetry breaking, which is sometimes viewed only as 
an artifact of the model. Nevertheless, the symmetry breaking can reflect 
the internal structure of the many-body state and in many cases gives a more 
transparent picture of important properties, like shape deformation, spin- 
density waves and electron localization, than the exact many-body wave 
function. 

The many-body physics of small qauntum systems is rich. There is still 
need for exact many-body calculations, approximative methods as well as 
for simple models. 

The author wihes to thank Stephanie Reimann, Matti Koskinen, Ben Mottelson, Pertti 
Lipas and Jere Kolehmainen for many helpful and exciting discussions. This work has 
been supported by the Academy of Finland under the Finnish Centre of Excellence 
Programme 2000-2005 (Project No. 44875, Nuclear and Condensed Matter Programme 
at JYFL). 
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THEORY OF CLUSTER MAGNETISM 



G.M. Pastor 



1 Introduction 

Magnetism is a major area in fundamental and applied research, a subject of 
main concern in human knowledge since the antiquity, which remains, even 
nowadays, quite elusive. The principal sources of magnetism in atoms, clus- 
ters and solids are the currents associated to the electronic motion (orbital 
magnetism) and the intrinsic angular momentum of the electrons (spin mag- 
netism). These properties are of quantum mechanical nature and cannot 
be regarded as the superposition of independent individual-electron con- 
tributions. They result from the behavior of the ensemble of electrons in 
the relevant low-energy scale (valence electrons). Consequently, the mag- 
netic properties of matter are very sensitive to the details of the electronic 
correlations and to temperature. 

Finite clusters constitute a new state of matter with its own fascinating 
phenomena. From this point of view, the properties of atoms and solids may 
be regarded as limiting cases of a much richer and complex dependence on 
cluster size. One of the main goals of the theory of metal clusters is to 
understand how the properties of localized, atomic electrons change as they 
become part of a finite cluster and start to delocalize, and how bulk-like 
behavior is attained. The study of magnetism constitutes one of its major 
challenges. In isolated atoms almost all elements show a non- vanishing 
magnetic moment given by Hund’s rules, while in the solid state only a few of 
them (some transition metals of the Fe group, the lanthanides and actinides) 
preserve a non-vanishing magnetization. The latter is a consequence of 
the delocalization of the valence electrons which favors equal occupation 
of states having opposite projections of spin and orbital moments. In this 
context the magnetism of transition-metal (TM) clusters is of fundamental 
interest, since atomic and bulk behaviors are intrinsically different. Atomic 
magnetism is due to electrons which occupy localized orbitals, while in TM 
solids the electrons responsible for magnetism are itinerant, conducting d- 
electrons. Therefore, the magnetic properties of TM’s are very sensitive to 

© EDP Sciences, Springer- Verlag 2001 
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system size and to the local environment of the atoms. Clusters of these 
elements show a wide variety of very interesting magnetic phenomena. 

The purpose of these lecture notes is to introduce the main concepts in 
the theory of cluster magnetism giving emphasis to the properties of tran- 
sition metals. To get started, the next section recalls the basic physical 
pictures of localized (atomic) and itinerant (band) magnetism. In Section 3 
representative experimental results on free and deposited transition-metal 
clusters are briefly reviewed. Sections 4-6 present various theoretical meth- 
ods to determine the electronic and magnetic properties of clusters as well 
as the main results derived from them. In Section 4 the size and envi- 
ronment dependence of T = 0 properties, such as spin moments, orbital 
moments, magnetic anisotropy and electronic structure, are investigated in 
the framework of a mean-field approach. Electron-correlation effects are 
analyzed explicitly in Section 5. Section 6 is focused on temperature depen- 
dent properties - average magnetization per atom, magnetic susceptibility, 
specific heat, etc. - and on their relation to the many-body excitation- 
spectrum. Finally, we conclude in Section 7 by pointing out some future 
research directions. 



2 Background on atomic and solid-state properties 



2.1 Localized electron magnetism 

In the presence of an external magnetic held = V x A(V • A = 0) the 
Hamiltonian of a non relativistic electron is modified in two major ways. 
First, as in classical mechanics, the linear momentum p is replaced by p -I- 
{ejc)A (e > 0). The kinetic energy can then be written as 



In eh ^ 

T = - — p H H ■ 

2m me 



{H X r)'^ 

2mc^ 4 



( 2 . 1 ) 



where i = f x p is the orbital angular momentum operator. Second, since 
the electron has an intrinsic angular momentum, we must take into account 
its direct interaction with the magnetic held given by 



ATYgpin — 9o Mb s • i7. 



( 2 . 2 ) 



where s = (f/2 is the spin operator, /xb = eh/2mc the Bohr magneton and 
go = 2.0023 ~ 2 the electronic g-factor. 

For a many-electron system (atom or solid) the Hamiltonian reads 

Ti = 7io + pbH ■ {L + go^) + ^ ] {H x r^)^. 



(2.3) 
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L = £i refers to the total angular momentum, S = Si to the to- 
tal spin, and Tig to the Hamiltonian in the absence of magnetic field. 
The last term in equation (2.3) is a repulsive potential that gives a dia- 
magnetic contribution to the magnetic susceptibility x [!]• The second 
term, the Zeeman interaction, gives a paramagnetic contribution to y that 
dominates over the diamagnetic term as soon as ( L -I- 25 ) yf 0. This is the 
case of almost all open-shell systems we are most interested in, e.g., TM or 
rare-earth atoms or ions. Notice that both orbital and spin contributions 
add up to form /iat = — /xb(T -I- 25) which can be regarded as the atomic 
intrinsic magnetic moment. Consequently, the low-lying eigenvalues of L, 
5 and the total angular momentum J = L + S are of central importance to 
the magnetic response. We shall therefore recall Hund’s rules which sum- 
marize the main results of the elementary theory of atomic multiplets [2]. 
These properties are important for understanding the sources of localized 
magnetism in isolated atoms or ions diluted in insulators, and also as a 
reference for comparison with the behavior of itinerant-electron magnetism 
in solids and as a function of cluster size. 



2.1.1 Magnetic configurations of atoms: Hund’s rules 

Consider an atom or ion with only one partially filled shell (for example, a 
TM or rare-earth atom) which is characterized by a given principal quantum 
number n and angular momentum £. Since closed-shell states are non de- 
generate and have L = S = J = 0 we need only to consider the electrons in 
the open shell. We have therefore 2{2£+ 1) one-electron states which can be 
characterized by the orbital momentum £ of the shell, the azimuthal quan- 
tum number m {—£ < m < £) and the spin a =t, i. Within the open shell 
the single-particle Hamiltonian of the electrons in a central field (ignoring 
electron-electron and spin-orbit interactions) is given by 



^0 = £-nS. cj^aCma- = Y -£<m<£, Cr=t,i (2.4) 

m,(7 m,(T 



where 






n(. 



= I Ktir) 



- V2 + Wore(r) 
2m 



Rne{r)r'^dr 



(2.5) 



is the average of kinetic and electron-core energies in the shell n£, and 
cJncr (cjncr) is the Creation (annihilation) operator for an electron of spin a 
at orbital Unim = Rnt.{r) Y£m{S,4>)- Since is independent of m and a, a 
shell having u = + i^i electrons would be (^t^^)“fold degenerate. 
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Part of this degeneracy is removed by the electron-electron interaction 



( 2 . 6 ) 



where 

V^jrt = f u*{fi)u*{r2) 1^ - I Mr(ri)Mt(r2)d^rid^r2 (2.7) 

J ri-r2| 

are the Coulomb integrals (z = nlm). The dominant Vijrt are those which in- 
volve only one pair of distinct electrons. Thus, one may simplify 
equation (2.6) by considering only the direct Coulomb integrals Umm' = 
Vmm'mm' that represent the electrostatic repulsion between 2 electrons oc- 
cupying orbitals m and m' , and the exchange integrals Jmm' = Vmm'm'm 
which are all positive [2] . Jmm' measures the energy difference between the 
singlet and triplet states that can be formed with two electrons occupying 
orbitals m and m'. Typical values for TM atoms are Umm' — 10 eV and 
Jmm' — 0.5— 1.0 eV. Keeping the terms (i = r, j = t) and {i = t, j = r), 
Hee takes the form 

n ee — 2 ^ ^ U (jf 

mm' 

(m<7)^(m' a-') 



2 ^ 

m^m' 






( 2 . 8 ) 



where fima = cln^Cma and s+ = cl^^Cmi [Sm = (s+)^]- = 

Umm' - Jmm' and u}^^, = U^^, = Umm' represent the effective Coulomb 
repulsion between electrons having parallel and unparallel spins. 

Hund’s first rule states that the ground-state configuration of open-shell 
atoms has the largest possible total spin S. From equation (2.8) one observes 
that it is energetically more favorable to set all the electrons in the open shell 
with the same spin due to the exchange interaction Jmm' ■ The ground-state 
spin S of an atom having v electrons in a partially filled shell is therefore 
S = vj2 for J/ < 2^ -h 1, and S' = 2£ -h 1 - Jz/2 for 2^ -h 1 < iz < 2(2£ -h 1). 
The orbital dependence of Umm' and Jmm' is not essential in this case, as 
can be easily verified by setting Umm' = U and Jmm' = J ■ However, if one 
assumes Jmm' = J and Umm' = U, an important degeneracy remains, since 
all states having maximal Sz = v^j2 < 2£-|- 1 and = 0) would 

have the same energy E{S = v^ /2) = {U — J)v^{i'^ — l)/2. 

Hund’s second rule states that the lowest-energy configuration among 
those having maximal S (1st rule) is the one with the largest total or- 
bital momentum L. This is a consequence of the orbital dependence of 
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Umm’ and Jmm'- In fact, the Coulomb repulsion energies among electrons 
with the largest possible m are weaker, since the corresponding electronic 
densities </')P are more distant in average [2]. For example, for d- 

electrons the deviations AUmm' of = Umm' — Jmm' from the average 
value are approximately A[/ 2 ,i = —58u, AU 2 ,o = —58u, AC/ 2,-1 = —4m, 
AC/ 2,-2 = 50 m, AC/ 1,0 = 23 m and AC/ 1,-1 = —31m, where m is a constant 
related to a radial Coulomb integral (u = F^/441 and ~ 49J [2]). The 
deviations of Umm' and Jmm' from the averages U and J are typically of 
the order of J/10 ~ 50—100 meV. Taking into account the 1st rule and 
the exclusion principle we have L = v{2l — m + l)/2 for < 2C + 1, and 
L = {v — 21 — 1)(4? — M + 2)/2 for 2i + 1 < v < 2(2£ + 1). For example, 
for V = 2 {v = 7) d electrons the lowest energy configuration has S' = 1 
(S = 3/2) and T = 3. 

Hund’s third rule states that the lowest-energy multiplet has a total an- 
gular momentum J = |L — S| for m < 2C -|- 1, and J = L -|- S for 1 / > 2C -|- 1. 
This is the result of spin-orbit interactions. In fact, the non-relativistic 
Hamiltonian commutes with L and S. Therefore, the (2S -I- 1)(2L -I- 1) 
ground-state degeneracy cannot be removed at the non-relativistic level. 
For not too heavy elements (TM’s and even lanthanides) L and S can still 
be considered as good quantum numbers. The major non-scalar relativistic 
effects are then taken into account by the spin-orbit Hamiltonian 

Hso = AL-S, (2.9) 

where A > 0 for 0 < < 2/ -|- 1 and H < 0 for 2C -|- 1 < m < 2(2/ -|- 1) [3]. 

Consequently, Hso yields antiparallel alignment of L and S for shells less 
than half filled (J = |T — S'!) and parallel alignment of L and S for shells 
more than half filled {J = L + S). 

2.1.2 Magnetic susceptibility of open-shell ions in insulators 

At low temperatures the free energy F = —kUT In Z of an ion in an insulator 
can be calculated by considering its 2J -|- 1 lowest-multiplet states (J 7 / 
0). From equation (2.3) the field dependence of the energy of \ JLSJz) is 
given by 

Ejlsj. = Eo(JLS) + fiBH{JLSJz\Lz + goSz\JLSJz) ■ 

Applying the Wigner-Eckardt theorem within the JLS multiplet we have 
{JLSJz\Lz + goSz\JLSJz) = g{JLS) {JLSJz\Jz\JLSJz), 



g{JLS) 



3 1 IS{S+1)-L{L+1) 

2 2 [_ J{J+1) 



where 
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is the gyromagnetic Lande factor {go = 2). The magnetic field splits the 
2 J+1 states uniformly by g{JLS) hb H . The magnetization is then given by 



M = - 



dF 



g/ZB J Bj 



/ gg,BJH \ 

\ kBT J’ 



( 2 . 10 ) 



where Bj{x) is the Brillouin function, and the zero-field magnetic suscepti- 
bility by 



d^F ^ gVI J(J+1) 
dm H=o 3 kBT 



( 2 . 11 ) 



This is Curie’s law which describes the magnetic response of impurities that 
keep a localized permanent moment, for example, rare-earth impurities in 
insulators. In order to compare with experiments it is customary to write 



= 2 P" 

^ fVa 3^®fcBT’ 



( 2 . 12 ) 



where p is the effective Bohr magneton number. From equation (2.11) one 
observes that the theoretical free-ion value of p is pth = g{JLS) yV(J+l). 

Extensive experimental studies show that in the case of rare-earth in 
insulators Curie’s law is followed very closely with J = |L±S'|, i.e., the 
effective moment is the result of the vector sum of spin and orbital contri- 
butions as in the free ion [1]. This is a consequence of the fact that the 4/ 
valence orbitals are very localized. 

In the case of TM impurities, Curie’s law is still followed very closely in 
what the 1 /T dependence is concerned, but the measured effective moments 
p deviate considerably from pth = g{JLS) \J J{J + 1) [1]. The d-electron 
states in TM’s are more delocalized and thus more affected by the atomic 
environment than the 4/ orbitals. In TM ions the experimental p no longer 
follows the free-ion total moment J = |L ± S'] but rather p ~ 2a/S^ 5+1), 
which corresponds to J ~ S'. One says that the orbital moment vanishes or 
that it is “quenched” under the action of the crystal-field. Still, remains 
a good quantum number, i.e., the 3d states preserve their identity. For TM 
ions the interaction with the atoms surrounding the ion are more important 
than the weaker spin-orbit energy. For the localized / states of the heavier 
rare-earths the opposite trend holds. The non-spherical potential created 
by the atoms surrounding the magnetic impurity, known as crystal field, 
induces transitions between the different m = ±1 and m = ±2 d orbitals, 
and may also favor energetically the occupation of the m = 0 orbital yielding 
(Lz) — 0. However, notice that in many cases the spin-orbit interactions 
still induce small orbital moments (Lz) which add up to the dominant spin 
moments. For example, one observes that Pexp is somewhat larger than 
Pth{J=S) for Fe, Co and Ni [1]. 
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2.1.3 Interaction between local moments: Heisenberg model 

Consider a system composed by N local moments, for example, a periodic 
array, a finite cluster of magnetic atoms, or an alloy with non- vanishing con- 
centration of magnetic impurities. As long the magnetic moments preserve 
their local nature, the quantum mechanical state of ion i is characterized by 
the vector operator Si, that can be the total moment or the spin operator 
depending on the role of L. The simplest, lowest-order rotational invariant 
that can be formed with two vector is their scalar product. Thus, the inter- 
action energy between two local spins can be described quite generally by 
the Heisenberg model 



n = -Y,,hj Si-Sj. (2.13) 

i<j 

The interatomic exchange coupling constants Jij depend on the microscopic 
origin of the interactions and may be positive, which favors ferromagnetic 
order (parallel spins), or negative, which favors antiferromagnetism (an- 
tiparallel spins). The general symmetry considerations behind the form of 
the Heisenberg interaction explains the success of the model for a variety 
of different physical situations. One may distinguish, for example, direct 
exchange interactions yielding usually Jij < 0, superexchange or RKKY 
indirect exchange in which the local spins couple through their interactions 
with the surrounding insulating or metallic electrons, and itinerant exchange 
as found when the electrons forming the spins moments Si are delocalized 
conduction electrons (metallic magnetism). 

In order to discuss a few important properties of ferromagnets we con- 
sider the mean-field approximation to equation (2.13). We focus on a par- 
ticular site j and approximate the interaction with the surrounding sites i 
using the average value of the magnetization {Si). The resulting mean-field 
Hamiltonian reads 



Umf 



Y.s,-Y,j,j{Si)- 

3 * 



Notice that this is a quite serious approximation, since it neglects the ex- 
change couplings Sf S~ + S~ S~j responsible of spin waves as well as short- 
range magnetic correlations ((5i--Sj) yf {Si)-{Sj)). Within this approach the 
effect of spin interactions can be regarded as an effective local magnetic field 






eff 



= ^J2j,^{S,) = —ZjJ{S.), 
9^J-B V 5MB 



(2.14) 



where Zj is the local coordination number at atom j {Jij = J only for NN’s). 
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The magnetization M is obtained by replacing H hy H + i?®® in 
equation (2.10): 



/c \ CD ( 9 ^J'■aSH + zJ{S^)S\ 

M = g^xB.{S^) = g9,B.SBs\ I- (2.15) 



The physical picture of the temperature dependence of the magnetization 
is the following. At low temperatures, FM order dominates since parallel 
ordering of local spins yields the lowest energy. As the temperature increases 
the spins start to disorder in order to gain the entropy of mixing the different 
spin configurations. Temperature-induced spin flips reduce {Sz) and tend 
to average out the exchange interactions which sustain long-range magnetic 
order (i/®® oc {Sz))- {Sz){T) decreases with increasing T vanishing at 

the Curie temperature Tq- If short-range magnetic order is neglected, all 
(25 -I- l)'^ spin configurations are equally probable for T > Tc, once (Sz) = 
0. The entropy of spin mixing, S = fcBA^ln(25-|-l), is therefore an important 
contribution to the entropy of a ferromagnetic system. 

In mean-field approximation the Curie temperature 



_ ^J5(5-H) 
3fce 



(2.16) 



can be regarded as proportional to the energy required to flip a spin from 
Sz = S to Sz = —S in the field iJ®® generated by z surrounding spins having 
Sz = S. Differentiation of equation (2.15) yields the magnetic susceptibility 
above Tc which follows the Curie- Weiss law 



X = 99B 



d{Sz) 

dH 



H=0 



g VI 5(5+1) 1 

3fcB T-Tc' 



(2.17) 



The divergency of y at Tq indicates the onset of long-range magnetic or- 
der. In real systems one observes that close to the phase transition y ex 
{T — Tc) with a critical exponent 7 yf 1. 

This closes our discussion of localized-electron magnetism. We have 
seen that the atomic like total moments J = |L±5|, preserve their identity 
in the case of localized 4/ electrons and that the crystalline environment 
modifies appreciably the magnetic behavior of TM ions, even in the case 
of an insulating matrix. The magnetic interactions between local moments 
have been described by the Heisenberg Hamiltonian, where the temperature 
dependence of the magnetization results from fluctuations of local degrees of 
freedom. Although successful in several respects, the local picture alone is 
certainly inappropriate for describing magnetic metals where the electrons 
are delocalized throughout the entire system (cluster or solid). The main 
features of itinerant magnetism are discussed in the following section. 
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2.2 Stoner model of itinerant magnetism 

The 3d electrons in TM solids are itinerant electrons that form bands and 
that contribute to the conductivity and to the low-T linear specific heat. It 
is therefore of considerable interest to discuss the sources of a ferromagnetic 
instability in a metallic system, particularly in view of the study of TM 
clusters. Stoner proposed a simple model which illustrates the effect of 
exchange interactions on itinerant electrons in a clear yet oversimplified way. 
The interaction energy of the electron gas is approximated by considering 
two different Coulomb repulsion strengths. The first one, denoted by U, is 
the average Coulomb repulsion between electrons having different spins and 
the second one, denoted by U — J, is the average repulsion between electrons 
having the same spin, which is reduced by the exchange integral J. For a 
given number of electrons per atom v = + ni and spin polarization 

— ni, the Coulomb interaction energy per atom is given by 

^ ^ ~ - J K - ■ (2-18) 



Exchange interactions tend to stabilize a ferromagnetic solution, i.e., — 

rii 7 ^ 0. This tendency to maximize {Sz) = \ (u| — nj^), that in the isolated 
atom leads to Hund’s 1st rule, is counteracted by the kinetic energy Ek 
associated to electron delocalization throughout the cluster or solid, which 
favors equal filling of spin up and down states. Notice that in the solid the 
spin of the atomic d-shell is no longer well defined as in the isolated atom. 
Nevertheless, the d-electron states remain the fundamental building blocks 
of the magnetic bands. 

In order to calculate Ek for given and n|, we occupy the lowest- 
energy k states having £fc < £f — A for spin down, and £fe < £p + A for 
up spin. Assuming that the paramagnetic density of states (DOS) per spin 
No{e) = — Ek) depends weakly on £ for |£ — £p| < A, we have 

= 2AA/o(£f) and 

Ek = E'^+ Afo(£F)A^ = • (2.19) 

The higher the DOS at £p is - i.e., the narrower the band is or the closer 
we are to the atomic limit - the smaller the kinetic-energy increase upon 
spin polarization. From equations (2.18) and (2.19) one obtains that a 
ferromagnetic instability occurs when 



A/o(£f) 



( 2 . 20 ) 



This is known as Stoner’s criterion for band ferromagnetism. Despite the 
limitations of the involved approximations (mean- field treatment of Eq, 
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rigid-band calculation of Ak, etc.) the Stoner criterion already reflects the 
fundamental interplay between kinetic energy and Coulomb energy which 
is so important in itinerant-electron magnetism. 

The self-consistent value of the magnetization M = fining — n-i) results 
from minimizing the total energy 



E = Eq + E}^ 




A/o(— e)] sde 



with respect to A (ep = 0)- In the solid, depending on the shape of the 
density of states and on the value of J, M{T = 0) may take any value 
from the non-magnetic case (n| = ni), over non-saturated ferromagnetic 
solutions (0 < n| — nj^ < v), up to saturated ferromagnetism (n| = jz, 
rii =0). This is a consequence of the delocalization of the electronic states 
and of the continuum DOS of the macroscopic metal. As we shall see, 
in the case of finite clusters M increases with J showing discrete steps 
AM = 2£/Aa with integer £. 

The explanation of non-integer values of the local moments at T = 0 is 
the mayor goal of Stoner’s theory and of the various mean-field approaches 
to ferromagnetism that were inspired by it. Moreover, extensions of the 
Stoner theory describe correctly the environment dependence of TM mag- 
netism in alloys, close to surfaces and clusters. In fact, we shall use such 
an approach to explain novel magnetic properties of clusters at T = 0, such 
as the enhancement of spin and orbital moments in Fe, Co and Ni clusters, 
the onset of magnetism in clusters of 4d TM’s such as Rh^r {N < 55) which 
is non magnetic in the bulk, or the size dependence of magnetic anisotropy 
energies. 

However, the straightforward extension of Stoner’s theory to finite tem- 
peratures - which would consist in taking into account the Fermi function 
in the calculation of the kinetic energy - fails completely to describe the 
behavior of TM solids, surfaces and clusters at T > 0. First, the magnetic 
susceptibility above Tc does not follow the Curie law as observed in all 
ferromagnetic TM’s. Second, at low T, M{T) decreases too slowly with 
increasing T: M(0) — M(T) oc instead of M(0) — M(T) oc 

Third, the calculated Curie temperatures are of the order of the T = 0 
exchange-splitting Asx ~ J — 1 eV ~ 10"^ K, i.e., an order of magnitude 
larger than those observed in experiments ~ 1000 K in TM’s). Fi- 

nally, the entropy around Tq is far too small, which indicates that the Stoner 
model fails to reproduce the mechanism for the disappearance of long-range 
magnetic order. Rather than the simplified form of the model Hamilto- 
nian, the main reason for these drawbacks is the mean-field treatment of 
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electron-electron interactions. In other words, Stoner’s approach does not 
take into account the fluctuations of local spin degrees of freedom and is 
thus unable to discern between the rather large exchange energy associated 
to the formation of local moments (Hund’s first rule) and the much smaller 
energy required to produce spatial fluctuations of the orientation of local 
spin polarizations (spin waves) . Therefore, we shall not stop here to discuss 
temperature dependent properties in the framework of Stoner’s theory. The 
interested reader should refer to [1] and [5] . We shall return to the problem 
of spin fluctuations in itinerant magnetism in Section 6.1, where we discuss 
the finite-temperature magnetic properties of TM clusters using a functional 
integral approach. 

2.3 Localized and itinerant aspects of magnetism in solids 

The localized and itinerant models of magnetism have diametrically opposed 
starting points. The former starts from electrons occupying atomic like 
states (localized in real space), while the latter starts from Bloch-like elec- 
trons which are spread out throughout the system (localized in reciprocal k 
space). They may be regarded as the two limiting cases obtained by varying 
the relative strength of the local Coulomb interactions {U and J) as com- 
pared to the kinetic energy of electron delocalization (bandwidth W). The 
local moment picture fits at best to 4/ or 3d magnetic impurities in insula- 
tors, while the band picture of magnetism is a good starting point for metal- 
lic ferromagnets having small magnetizations (~ O.I^b) and low Curie tem- 
peratures (~ 10 K). However, none of these approaches yields a satisfactory 
description of transition-metal solids. Some of the properties of TM’s can be 
understood within the local moment picture but not within Stoner’s model, 
and vice versa. For example, the non-integer values of the T = 0 magnetiza- 
tions [/ib(Fe) = 2.2pB, Mb(Co) = 1.7/iB and /Xb(Ni) = 0.6 ^b] and the strong 
environment dependence of the local magnetic moments close to surfaces, in 
Aims or alloys are well described by the band approach to magnetism. These 
are characteristics of itinerant magnetism which contrast with the environ- 
ment independent integer or half-integer values of the atomic-like angular 
momentum J or spin S. On the other side, the low-temperature de- 
crease of M{T) indicates the presence of low-energy spin-wave excitations. 
Neutron scattering experiments show the existence of local magnetic mo- 
ments with an appreciable degree of short-range magnetic order (SRMO) at 
temperatures above Tq. A Curie- Weiss susceptibility y ~ 1/(T — Tq) is ob- 
served for T > Tq, and the measured Curie temperatures are of the order of 
1Q3 K [Tc(Fe) = 1043 K,Tc(Co) = 1388 K,Tc(Ni) = 627 K]. None of these 
properties can be understood using a straightforward finite-temperature ex- 
tension of the band approach. In fact, Stoner’s theory predicts erroneously 
i) low-temperature spin-flip excitations across a gap of the order of 1 eV, 
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ii) k^Tc — JM{T = 0) ~ 10^ K, in) the disappearance of all local magnetic 
moments at T = Tq, and iv) no Curie- Weiss susceptibility at T > Tc- 
Summarizing, the mean-field approach to band magnetism is qualita- 
tively correct at T = 0 but fails at T > 0, whereas the local moment 
approach - obviously inappropriate for metals ~ fails at T = 0 but is qual- 
itatively correct as a picture for the finite-temperature excitations. These 
properties suggest the existence of local moments in magnetic TM’s. The 
duality between itinerant and local behaviors is an essential feature of TM 
magnetism which reflects the importance of electron correlations in these 
systems. A theoretical framework which unifies these two approaches was 
developed in the 70’s by applying functional-integral methods to single-band 
and d-band Hubbard models [6,7]. This is known as spin-fluctuation the- 
ory of itinerant-electron magnetism and shall be discussed later on in the 
context of finite-temperature properties of clusters. 



3 Experiments on magnetic clusters 

Theorists were the first to study the magnetism of clusters. Salahub et al. 
determined the magnetic moments of small 3d TM clusters using the X-a 
approximation [8]. It was predicted that the magnetic moment per atom 
fiN in small bcc-like Fbn clusters should be jig = 2.89^b and ^15 = 2.67^b, 
values that are considerably larger than the bulk moment /Zb = 2.2 /tb. 
These results were confirmed qualitatively by the local-spin-density calcu- 
lations of Lee, Callaway and coworkers who obtained p,g = 2.89/xb and 
p,i 5 = 2.93/iB [9]. An alternative to ab initio calculations was provided by 
the self-consistent tight-binding (SCTB) studies on Cr, Fe and Ni clusters 
(N < 51) [10]. This method yields results in good agreement with ab initio 
calculations [/l(FeAr) ~ 3.0/iB] and, taking advantage of the flexibility of the 
parameterized minimal-basis approximation, it explored a variety of cluster 
structures and sizes that remain even nowadays unaccessible to first prin- 
ciples techniques. The main predictions of these theoretical studies [8-10] 
were confirmed a couple of years later by Stern-Gerlasch measurements. 
The theoretical research on cluster magnetism was considerably boosted by 
these remarkable experimental findings [11,12]. An important number of 
calculations on magnetic TM clusters were then performed within the frame- 
work of the local spin density approximation, self-consistent tight-binding 
methods and many-body model Hamiltonians [13-16]. 

The first experiments on the magnetic properties of free TM clusters 
were performed by Cox et al. [17]. As in all the other gas-phase exper- 
imental studies that followed, the Stern-Gerlasch (SC) deflection is used 
to determine the magnetic properties. A typical experimental arrangement 
- see Figure 1 of [17] - involves three main steps: i) A beam of neutral 
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clusters containing a more or less broad distribution of sizes is produced, 
usually by using a laser vaporization source, ii) The neutral clusters pass 
through a SG magnet where they are deflected along the field-gradient direc- 
tion. The deflection depends on the value of the projection of the net cluster 
magnetization onto the magnetic held H . Glusters which magnetization is 
parallel (antiparallel) to H are deflected in the direction of increasing (de- 
creasing) held. Hi) Finally, the clusters are ionized in order to be detected 
after mass selection. 

Gox et al. observed that the beam intensity at the beam axis is depleted 
upon switching on the SG magnetic held. This was the first experimental 
indication that small Fcat clusters are magnetic (2 < TV < 17) [17]. The 
measurement were interpreted assuming that the magnetic clusters should 
be deflected in the external magnetic held H essentially like atoms carry- 
ing a large magnetic moment TV/ijv. This would correspond to an equal 
probability in the direction of increasing and decreasing held, since it is 
a priori equally probable to And parallel or antiparallel to H. How- 
ever, later experiments [11,12] revealed that this assumption was incorrect 
and that performing only on-axis detection (ionization) is a strong limita- 
tion. Measuring only the on-axis depletion factor is therefore insufficient 
for determining fijq. 

The next important experimental progress was achieved by de Heer 
et al. [11]. They performed SG experiments on FeA? (15 < < 650) and 

measured the cluster intensity all along the direction of the held gradient, 
which is perpendicular to the beam axis. In this way they discovered that 
the clusters deflect dominantly in the direction of increasing held, showing 
a somewhat broad spatial distribution (see Fig. 1 of Ref. [11]). This implies 
that the magnetization of isolated Fe clusters tends to align parallel to an 
external magnetic held, a remarkable effect which was not expected [11]. In 
this work, the average deflection D was related directly to the average mag- 
netic moment per atom fi by using the relation p, oc Dmv^ / H . However, it 
is actually the average magnetization of the cluster ensemble including re- 
laxation effects that should be related to D. The values of p derived in this 
way increase with increasing magnetic held H and are much smaller than 
the bulk atomic moments /ib. De Heer et al. already recognized that the 
cluster magnetic moments determined in such a way are a lower bound to 
the average magnetic moments of the cluster [8-10]. Due to the assumption 
p oc Dmv'^ / H the comparison between theory and experiment seemed quite 
controversial, also concerning the temperature dependence of D. It was not 
until the work of Bucher et al. [12] that the relation between the measured 
deflections D and the intrinsic cluster moment pN became clear. 

Bucher et al. performed experiments on Con clusters (20 < < 200) 

and showed that the observed small average deflections could be 
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interpreted as the result of relaxation of the magnetization of superparamag- 
netic clusters in the direction of H [12]. Theoretical analysis by Khanna and 
Linderodth [18] and by Jensen et al. [19] supported this interpretation. Su- 
perparamagnetic relaxation is a well-established phenomenon in the case 
of supported magnetic nanoparticles at temperatures above their blocking 
temperature. While the later is a consequence of thermal average of the ori- 
entation of magnetic moments in contact with a macroscopic environment, 
the mechanism of spin relaxation in free clusters is not yet completely un- 
derstood [20]. Assuming that the clusters undergo superparamagnetic re- 
laxation, Bucher et al. inferred intrinsic magnetic moments per atom which 
are larger than the bulk moment [P.{Con) = (2.1 ± 0.2)/tb] in agreement 
with existing calculations [8-10]. The superparamagnetic regime opened the 
possibility of a series of systematic experimental studies [21-26]. RhAr clus- 
ters were found to have rather large magnetizations [23] . This was the first 
experimental observation of a transition from non-magnetic to magnetic be- 
havior upon reduction of system size. Theoretical results already indicated 
this possibility [14, 27, 28] . Experiments on clusters of other non-magnetic 
TM’s (Vat, CrAT and PdAr) have yielded no measurable SG deflection so 
far [22]. 

Billas et al. determined the magnetization of Foat clusters as a function 
of size and temperature for 25 < N < 700 and 100 K < T < 900 K [24]. 
They obtained that the low-temperature average moment ftN increases with 
decreasing cluster size, exhibiting some oscillations as a function of N and 
reaching a value of about 3/iB for the smallest sizes (T ~ 100 K). A similar 
behavior is also observed for Co and Ni clusters. Stern-Gerlasch measure- 
ments on NIat clusters have been also reported in [26]. However, there are 
still some quantitative discrepancies between the of Mat obtained in 
different experiments [25,26]. 

The experimental results for the temperature dependence of the average 
magnetization per atom p,f^(T) are far from simple and depend significantly 
on the considered TM [25]. In Mat the magnetization curves are quali- 
tatively similar to the bulk, except of course for an important finite-size 
broadening of the transition around the Curie temperature Tq. Experi- 
ments on Coat show that the magnetization per atom is about 0.1— 0.5/iB 
larger than the bulk fJ.b{T) for 50 < N < 600 and 100 K < T < 1000 K [25]. 
At low temperatures, 100 K < T < 500 K, the magnetization of Con is 
found to increase slightly with T. This is an unexpected effect that is not 
observed in the solid. Note that these temperatures are significantly lower 
than the bulk Curie temperature Tc(Co) = 1388 K. In Fe clusters the 
temperature dependence is qualitatively different from that of Ni or Co 
clusters. For 250 < N < 600 one observes a rapid, almost linear decrease 
of the magnetization with increasing T (T < 500—600 K). For T > 300 K, 
P,n{T) is smaller than in bulk M{T) although at T = 0 it was larger 
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[Tc(Fe-bulk) = 1043 K]. As the cluster size increases (250 < N < 600) 
/ijv(T) decreases further making the difference between cluster and bulk 
magnetizations even larger [25] . One expects that this trend should change 
at larger sizes, although no experimental evidence is available so far. 

In addition to the values of the intrinsic magnetic moments of isolated 
clusters, the Stern-Gerlasch beam experiments yielded remarkable results 
for the magnetic behavior of a cluster ensemble [11,12]. Several theoreti- 
cal studies were motivated by the asymmetric magnetic deflection and by 
the magnetic-field dependence of the average magnetization of the cluster 
ensemble which shows strong deviations from Langevin behavior [18-20]. 
Assuming superparamagnetic relaxation, the experimental findings could 
be analyzed consistent with electronic calculations of ft n. However, a mi- 
croscopic understanding of the remarkable spin relaxation process in an iso- 
lated cluster deserves further study. The experiments on Fon in a magnetic 
held also revealed the breakdown of the superparamagnetic regime in the 
case of supersonically cooled clusters (see Fig. 1 of Ref. [24]). This has been 
interpreted as the result of a resonant coupling between the Zeeman levels 
that is induced by the rotations of the cluster in the magnetic held [20,24]. 

Experiments on clusters embedded in a macroscopic matrix or deposited 
on surfaces are of considerable interest in view of applications in storage and 
recording technologies. From a fundamental standpoint, they provide com- 
plementary information to studies of free clusters with several experimental 
advantages, such as a precise control of the sample temperature including 
the possibility of cooling down to very low temperatures, both of which are 
difficult in the gas phase. Large densities of clusters in colloids, clusters 
codeposited in a matrix or deposited on surfaces can be produced which 
can be investigated by powerful condensed-matter techniques such as mag- 
netization measurements, Mossbauer spectroscopy [29] and X-ray magnetic 
circular dichroism [30,31]. However, note that in these cases the hybridiza- 
tions, distortions and bond-length changes induced by the interactions with 
the support often result in magnetic properties that may differ considerably 
from those of free clusters. 

The study of supported clusters emphasizes the close relations between 
the magnetic properties clusters, nanocrystals, nano-structured materials, 
thin Aims and surfaces. For example, experiments on Go nanoparticles 
(1.5—2 nm diameter) in colloidal solutions [32] show an enhancement of 
the mean magnetic moment per atom of about 20% with respect to the 
solid and magnetic anisotropies that are much larger than in the bulk [29]. 
For 2-layer-high Go islands deposited on Au(lll) [30] one also observes an 
enhancement of the magnetic anisotropy energy per atom and of the orbital 
magnetic moments as the cluster size is reduced. Fe clusters deposited on a 
graphite surface (HOPG) with sizes ranging from 1—5 nm (40—5000 atoms) 
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present orbital moments (L) that are twice as large as in the bulk (Ab — 
0.1 /ie) [31]. These results are in agreement with theoretical predictions on 
small clusters [33]. 

4 Ground-state magnetic properties of transition-metal clusters 

In this section we introduce the electronic models relevant for studying 
the magnetic properties of low-dimensional TM systems. The ground-state 
properties of clusters are investigated by using a realistic spd-band model 
Hamiltonian and a mean-field treatment of electron-electron interactions. 
The size and structural dependence of several magnetic properties are ana- 
lyzed and the microscopic origin of experimental observations is discussed. 
A rigorous treatment of electron-correlation effects is developed in Section 5. 

4.1 Model Hamiltonians 

We consider the non-relativistic Schrodinger equation for the valence s,p 
and d electrons and we expand the field operator ipo-ir) = (f’iair) Ciaa 
in a set of orbitals 4>ia{r) centered at each atom i. 4>ia{r) = 4>a{r — Ri) 
where Ri refer to the position of atom f, and a denotes the different s,p 
and d orbitals. The Hamiltonian can be written as 

H = Ho + V, 

'^o = J2 

ict 

313 

aa' ijkl ctfS-yS 

where 

ttl' = I 

is the transfer or hopping integral between the /3-th orbital at atom j and 
the ath orbital at atom i. refers to the single-particle energy level 

of the orbital ia. 

The Coulomb integrals 

= I d^rd^r'^o^ir- R^)rpir- 



2m 






(j)/3{f- Rj)d^r 



(4.1) 

(4.2) 

(4.3) 



represent transitions of a pair of electrons from at the orbitals fcy and IS to 
the orbitals ia and j/3. 
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The intra-atomic terms in V are the dominant ones for the magnetic 
properties. As already discussed, they are responsible for the Hund-rule 
correlations that lead to the formation of local magnetic moments in the 
open d shell. Therefore, in first approximation, we may treat explicitly 
only the intra-atomic Coulomb terms (i = j = k = 1) hy including the 
interatomic Coulomb contributions as a mean-field correction to the single- 
particle potential T4xt = K;ore + Vinter- In this way the interatomic terms 
screen quite efficiently the potential generated by the ions beyond the 1st 
or 2nd nearest-neighbor shell. An explicit treatment of interatomic contri- 
butions becomes important for heterogeneous clusters such as TM oxides 
where site-dependent charge transfers occur. 

The Coulomb integrals are roughly proportional to the product 

of the overlap between (j)ia and and between <j)jp and 4>k-y Thus, we 
may simplify the interaction Hamiltonian further by retaining explicitly 
only the terms involving at most two different orbitals, namely, the direct 
terms having ia = 15 and jj3 = ky, and the exchange terms having ia = ky 
and jP = IS (two-center approximation). Thus, the interaction term takes 
the form 



^ ^ ^ \ ^ ^ Uoi^Tlio(a^if3<7 ^ ^ (^a/3 Ja.^^ 



0.(3 



— J, 



(4.4) 



a^/3 



where Ca/? = and Jaf} = are, respectively, the direct and 

exchange Coulomb integrals between the orbitals a and P of atom i. For 
homogeneous clusters they are independent of i. Ja /3 is responsible for 
Hund’s first rule in the atom. With present computer facilities it is possible 
to perform mean-field ground-state calculations on small clusters using com- 
plex multiband models that treat all s,p and d valence electrons explicitly 
{e.g., Hartree-Fock or density functional theory in the local spin density 
approximation). The situation changes completely as soon as electron cor- 
relation effects or finite-temperature properties are addressed. In this case 
it is necessary to simplify the valence-electron dynamics further by focusing 
on the d orbitals responsible for magnetism. Comparative studies between 
spd and d models at T = 0 show, as expected, that the magnetic properties 
of TM systems are largely dominated by the d valence electrons. Therefore, 
one often restricts a and P in equations (4.2) and (4.4) to the 3d states. 
Replacing for simplicity Uap and Jap by their average values e°, U and 
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J, one obtains 

^“/4a<rCi/3<T (4.5) 

ioccr 

Ctl3(T 

and 

C = 2 Uaa'niaani/Sa' ~ 2 '^ > (4-6) 



where [7^^ = C, = U — J, and sf^ = c\^-^ Ciai- Notice that the self- 
interaction term a = f3 has been included in the first sum by replacing 
by e° — [/(T(t/ 2 in Hq. This model will be used in the following to investigate 
the size and structural dependence of a wide variety of magnetic properties 
of clusters. 



4.2 Mean-field approximation 

Introduce c-numbers lyiaa and G C, = Cia and rewrite V 

(Eq. (4.6)) using 

4“ (^iatr ^iaa^{.dip(7' r^z/3cr' ) 



and 



as 



ia^ip Cia^ip 4 ” Cip^ia ^ia^ip (^ia ^ia)(^ip ^ip) 



V = Vi-hV2, 

bl = y ] I y ] Uap l^ipa' I Tliaa 

iOC<7 Y 0(7' 



-E 



\P^a 



^ta + 






\0¥=a 



2 E! ^aP ^iaal^iPa' 4" E! '^oipCtaCip^ 

0.0 0:^/3 

^2 — ^ ^ ^o0 (.^10(7 ^zckct) {_‘^i0<7' ^i0a'^ 



E - Ci(.) [hp - Qp) + h.c. 
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The mean-field approximation is obtained by neglecting the fluctuation (or 
correlation) term V 2 and by choosing lyiaa- and such that the ground-state 
energy of 7ii = Tio + Vi is minimized. The conditions 



d(ni) 

r)u- 

^^lOC<7 



= 0 



and 



d{Hi) 

dC^ 



= 0 



yield the usual selfconsistent equations 



^ia<7 — {‘^iaa') 

C = (si) (4-8) 

where (•••) implies average over the ground state of Tii. For simplicity 
we assume in the following that the magnetic moments are collinear, i.e., 
Cili = 0, V ia. The collinear state is always a selfconsistent solution since, if 
Cia = the eigenstates of Hi are products of spin-up and spin-down states, 
and then (s4(,) = 0. However, it is sometimes possible to find lower-energy 
solutions having non-collinear arrangements of the local spin polarizations. 
This is the case when magnetic frustrations are present, for example, in 
antiferromagnetic compact clusters (see Sect. 5.4 and Refs. [34-36]). 

In the mean-field collinear approximation the Hamiltonian is given by 

^ ~ Siaa-niaa + (4.9) 

i(X(7 

Ct(3<7 



with 

^iaa — ^ict T ^ ] iUa(i ^ip ^ ^ JapP'ip- (4-10) 

/3 P 

Here, Vijj = {hip^^) + {fiipi) and pLip = (h*/ 3 |) - {fiipi) refer, respectively, to 
the average occupation and spin polarization of orbital ij3. 

The average occupation numbers (hiaa-) are determined self-consistently 
from 



■^iaa (^)d£:, 



where 

— fell (^) } 

7T 

is the local density of states (LDOS) per spin orbital and Giaa,iaa{s) the 
local Green’s function [G = (e — H)~^]. A particularly efficient way of 
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determining Afiaais) in systems lacking translational symmetry (free and 
deposited clusters, surfaces, thin films, etc.) is the Haydock-Heine-Kelly’s 
recursion method [37], which expresses Giao-,iaa-{^) a local expansion 
around each atom i of the cluster. This local approach is physically very 
appealing since it stresses the role of the local atomic environment and its 
changes as function of cluster size or as we move from the interior to the 
surface of the cluster. 



4.3 Second-moment approximation: Enhancement of local and average spin 
magnetic moments 



The lowest-order local approximation to Afiaa-is) is the second-order approx- 
imation which is obtained by taking into account only the contributions to 
Gia<T,iao-(e) from the nearest neighbors of atom i. Dropping the orbital 
dependence (C/a /3 = U, Jap = J) and averaging over a one obtains 






10 L _ (e - \ 

TTWi \ wf / 



(4.11) 



where = {zijzb)wl is the second moment of the local DOS^. Wi = 
\fz^J^w\:, plays the role of an effective local band-width that depends on 
the local coordination number Zi at the cluster atom i. Zb refers to the 
coordination number in the bulk (zb = 12 for an FCC lattice, for example) 
and Wh to the bulk band- width. 

As illustrated in Figure 1, the reduction of local coordination number 
results in a reduction of the local effective band width and thus in an en- 
hanced paramagnetic density of states A/io(£F) at the Fermi energy. The 
resulting self-consistent magnetic moments are given in Figure 2. This very 
simple approximation already explains qualitatively various major trends in 
low-dimensional TM magnetism: 



i) the local spin magnetic moments pi increase as the local coordination 
number decreases since the kinetic (promotion) energy is smaller for 
larger Afiais) (Stoner’s criterion). Thus, open surfaces show larger 
local moments than closed ones [38]; 

a) the average spin moment per atom piq of an A^-atom cluster increases 
with decreasing N since (z/) decreases as the cluster size is reduced. 
Moreover, the enhancement of pN originates at the cluster surface 
where z/ is smallest; 



^The second moment oiJ\fiaa{e) is - eiaa)'^JJiaa{£i)Ae. Setting eia^ = 

0, we have \{k\iaa)\'^ = {iaa\H'^\iaa) = = Gi/zb)wl, 

where and the second moment and local coordination number in the bulk. 
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Fig. 1. Local density of states A/”/q^(£) in the second-moment approximation for 
Zb = 12 and Zi = 5 (Eq. (4.11)). The Fermi energy ep = 0 corresponds to Fe 
(na = 7). 




Fig. 2. Selfconsistent magnetic moments as a function of J/wt as obtained by 
using the second-moment approximation to the local density of states. 

iiij bond-length contraction results in a reduction of /j-i and /2n since 
for shorter interatomic distances the hopping integrals and thus Wh 
increase; 

iv) it is in principle possible that some TM’s which are non-magnetic in 
the bulk could develop non-vanishing magnetizations if the system 
dimensions are reduced. This has been observed experimentally in 
the case of RhA? clusters [23]. 
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Of course, such a simple structureless LDOS cannot explain many other 
important properties. Equation (4.11) fails to reproduce that in clusters 
fiN can only adopt discrete values due to the discreteness of the electronic 
spectrum in a finite system. The same holds for the changes in the local 
spin moments observed even for atoms having a complete NN shell below 
the cluster surface. Equation (4.11) is also unable to distinguish between 
different magnetic orders which may depend on the cluster structure. These 
are just a few examples. Nevertheless, despite these drawbacks, the 2nd mo- 
ment approximation remains an illustrative start point for a more rigorous 
analysis of cluster magnetic properties from a local point of view. 

4.4 Spin magnetic moments and magnetic order 

The local magnetic moments p{i) at the different atoms i, the magnetic 
order and average magnetic moment per atom /Ijv of the cluster are de- 
termined using the self-consistent tight-binding (SCTB) model given by 
equations (4.9) and (4.10). The element-specific parameters required for 
the calculations are estimated as follows. The interatomic hopping inte- 
grals for d electrons are given by the canonical two-center approxi- 
mation taking into account the dependence on interatomic distances Rij: 
dd{a,7T,S) = (— 6, 4, — l)(lTd/2.5)(i?o/-Rii)^) where Rq is the bulk NN dis- 
tance. This involves one scaling parameter, namely, the bulk d band-width 
Wd (for example, Wd(Fe) = 6.0 eV and TTd(Rh) = 7.4 eV). Other more 
sophisticated choices may be considered, for instance, can be fitted 
to local-spin-density band-structure calculations for the solid. However, 
this does not affect the main physical conclusions. The direct Coulomb 
integrals U are estimated from atomic data, e.g., Udd(Pe) = 6.0 eV and 
C/d(i(Rh) = 7.8 eV. These values yield a density distribution close to lo- 
cal charge neutrality so that the results of SCTB calculations are not very 
sensitive to the precise value of C/ [10]. The main parameter for the determi- 
nation of magnetic properties is the d-electron exchange integral Jdd, which 
is usually fitted to the bulk moment pb sd T = Q {e.g., Jdd(Fe) = 0.70 eV). 
For non-magnetic TM’s such as Rh the exchange integrals are obtained from 
local-spin-density (LSD) Stoner theory [39] by taking into account a reduc- 
tion of 20% in order to account for correlation effects [40] . In the case of Rh 
this yields Jdd(Rh) = 0.48 eV [41]. Applying the same procedure to Fe one 
obtains Jd(i(Fe) = 0.71 eV, which is very close to the value derived from the 
bulk magnetization. 

4.4.1 Free clusters: Surface effects 

The onset of magnetism in clusters and its relation to the electronic struc- 
ture can be analyzed systematically by determining the magnetic properties 
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Fig. 3. Local magnetic moments fi{i) and average magnetic moment per atom 
p .15 (full curve) of Feis with bcc-like structure as a function of the ratio between 
the d-electron exchange integral J and the bulk d band- width W (see Ref. [10]). 
i = 1 refers to the central atom, i = 2 to its 1st NN’s and i = 3 to the outermost 
shell of the 2nd NN’s of the central atom. The corresponding bulk magnetization 
/ib is also given. 



as a function of the ratio between the intra-atomic fi-electron exchange in- 
tegral J and the band width W of the corresponding solid. In Figure 3 the 
local magnetic moments ^(z) and the average magnetic moment per atom 
fiN of a bcc-like Fei 5 cluster are given as a function J/W. The variations 
of J/W can be regarded as resulting from changes in the interatomic dis- 
tances Rij {e.g., for TM’s, W oc tij oc R/j^) or from changes in the spatial 
extension of the d orbitals for different elements within the same group. 

One observes that magnetism sets in for J > J^, where the critical 
value Jc depends on size and structure. In clusters Jc is smaller than the 
bulk Stoner value Jc = 1 /A/o(£f)- The cluster magnetic moments are thus 
larger than the bulk magnetization. This is a consequence of the reduction 
of the local coordination numbers as the size of the system decreases [10] 
in agreement with cluster experiments and well-known surface properties 
[12,24,25]. Larger clusters show a similar behavior. 

A particularly interesting result is that the clusters can be magnetic for 
values of J/W for which the macroscopic solid is non-magnetic. A tran- 
sition form paramagnetic to ferromagnetic behavior is therefore possible 
by reducing the system size. Indeed, this phenomenon has been experi- 
mentally observed in the case of Rh clusters [23] . For more recent specific 
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calculations on RliAr see [28,41,42]. At this point it is important to recall 
that the Hartree-Fock approximation tends to overestimate systematically 
the stability of ferromagnetism and the formation of magnetic moments 
due to an overestimation of the energy of non-magnetic states. Taking into 
account electron correlation effects is therefore crucial for a profound and 
systematic understanding of the magnetic behavior of 4d TM clusters. This 
problem is discussed in more detail in Section 5 by applying exact diago- 
nalization methods to the single-band Hubbard-model. 

As J is increased beyond Jc, the average magnetic moment per atom 
increases by discrete steps A/xat = 21 /N, where I is the number of electrons 
whose spin is flipped. The resulting redistributions of the spin-polarized 
density as a function of JjW are accompanied with strong changes in the 
local magnetic moments Close to a spin flip, the local densities of 

electronic states present large peaks at the Fermi energy ep, which indicates 
that the states changing occupation are highly degenerate. The electronic 
structure is thus very sensitive to the interatomic distances (see Figs. 3 
and 4 of Ref. [14]). 

Concerning the magnetic order within the cluster one finds that bcc-like 
clusters are in general ferromagnetic ]/i(z) > 0, Vf]. However, one sometimes 
observes, in a limited range of small values of JJW, that the local magnetic 
moments at the center of the cluster point opposite to the average magne- 
tization. In fact, for unsaturated magnetism, fj,(i) depends very strongly 
on the atomic position i. In particular for Rhjv this results in a remark- 
able size dependence of local and average magnetic moments [42] . The local 
spin moments tend to increase as we go from the center to the surface of the 
cluster. Therefore, the enhancement of the average spin magnetization of 
the cluster is dominated by the surface contributions. As already discussed, 
this can be qualitatively understood as a consequence of the reduction of 
the local coordination number and of the effective local d-band width [10]. 

The structural dependence of the magnetic properties can be illustrated 
by comparing the behaviors of clusters having fee- and bcc-like structures. 
In contrast to bcc clusters, fee Fe clusters present antiferromagnetic-like 
order, and thus much smaller average moments pN [10,14]. The presence 
of antiferromagnetic order in compact structures raises quite naturally the 
question of magnetic frustrations and possible non-collinear spin arrange- 
ments in finite systems. Such more complicated magnetic structures have 
been explored in the framework of the single-band Hubbard model [35] (see 
Sect. 5.4) and are expected to be important in Cr^v as well as in mixed 
clusters like (Cra,Fei_a;)Ar [43]. The remarkable role played in free clusters 
by the surface atoms let us expect a very interesting physics at the interface 
of embedded clusters. 
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4.4.2 Embedded clusters: Interface effects 

The magnetic properties of clusters are a subject of considerable interest 
in materials science in view of possible applications in magnetic recording 
and storage technologies. For many practical applications involving clus- 
ters, these are not isolated but embedded in a matrix or deposited on a 
surface. It is therefore important to extend our analysis of cluster mag- 
netism to situations where the clusters are in contact with a macroscopic 
environment. The studies on free and embedded clusters yield complemen- 
tary informations which contribute to the characterization of the various 
specific behavior of these novel materials. 

In this section we consider the magnetic properties of Fe clusters embed- 
ded in a Cr matrix. These materials are very interesting since they show 
different magnetic behaviors which are very sensitive to the structural and 
chemical environment of the atoms. The competition between the antiferro- 
magnetic order of the Cr matrix and the tendency of Fcat clusters to order 
ferromagnetically leads to particularly strong cluster-matrix magnetic in- 
teractions. Therefore, one expects to encounter a variety of novel magnetic 
effects. The problem is to some extent similar to that of Fe/Cr multilayers, 
which deserved considerable attention in past years. 

In Figure 4 results are given for the local magnetic moments of Foat 
clusters embedded in a Cr matrix as substitutional impurities [44, 45] . The 
main conclusions are summarized below: 

i) the magnetic moment of the atomic impurity, /i(l) = 0.26/iB, is dras- 
tically reduced with respect to the bulk magnetization /ib = 2.2 ^b 
of pure Fe. This can be interpreted as a consequence of the smaller 
exchange splittings of the Cr atoms with which the Fe-impurity or- 
bitals hybridize strongly. Moreover, the local magnetic moment at the 
Fe impurity is antiparallel to its NN’s, i.e., the magnetic coupling is 
consistent with the spin-density-wave state of the matrix; 

ii) as long as the size of the embedded cluster is very small {N < 4), 
the magnetic order within the cluster is antiferromagnetic with 
strongly reduced local magnetic moments /i(t). In other words, very 
small Fe clusters adopt the magnetic order induced by the spin-density 
wave of the matrix. This contrasts with the behavior observed for 
free clusters which show ferromagnetic order and /i(z) > /ib (see 
Sect. 4.4.1); 

Hi) for N > 6 one observes the expected transition from antiferromagnetic 
to ferromagnetic order within Fe n . This is followed by a considerable 
increase of the local magnetic moments at the Fe atoms, although 
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Fig. 4. Spatial distribution of the local magnetic moments /i(i) for Fejv clus- 
ters {N < 51) embedded in an antiferromagnetic Cr matrix. The non-equivalent 
atomic sites i are ordered by increasing distance to the central Fe-atom. Dots 
correspond to Fe atoms and crosses to Cr atoms. Notice the transition from anti- 
ferromagnetic to ferromagnetic order within Feiv for N = 4—6, and the decrease 
of fi{i) at the Fe atoms close to the Fe-Cr interface. 



close to the interface with the matrix the ^(z) are still smaller than 
fXb (see Fig. 4); 
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iv) at the interface there is a strong tendency to antiferromagnetic cou- 
pling between Fe and Cr moments. The magnetic order among Cr 
atoms, given by the sign of is not affected by the presence of 
the Fe cluster, even in cases where the shape of the cluster prevents a 
perfect antiferromagnetic coupling at the interface. However, one ob- 
serves important quantitative changes in \fJ.(i)\ at the Cr atoms close 
to the cluster; 

v) the magnetic moments ^Fe(*) of the Fe atoms are very sensitive to the 
local atomic environment. /iFe(*) is found to be roughly proportional 
to the number of Fe atoms 2pe(*) found in the 1st NN shell of atom i 
[/iFe(*) = 0.61/tb, 0.99/iB, 1.29 ^b and 1.42— 1.59/tb for ZFe(f) = 1—4, 
respectively]. For larger clusters (N > 9), the magnetic moment at 
the center of the cluster is quite close to the Fe-bulk value. However, 
the average magnetization per atom is always smaller than /Zb(Fe) due 
to the contributions of interface Fe atoms. The trend is thus opposite 
to that of free clusters or infinite surfaces; 

vi) the electronic densities of states reflect very clearly the cluster-matrix 
hybridizations and the transition from antiferromagnetic to ferromag- 
netic order within FeAr. Small FeAr clusters in Cr {N < 4) present an 
electronic structure which is quite similar to that of pure Cr. Larger 
Fe clusters {N > 9) show a ferromagnetic-like exchange splitting and 
the first signs of convergence towards Fe-bulk behavior. Nonetheless, 
the changes induced by the cluster-matrix interactions remain signif- 
icant [44,45]. 

Fe clusters embedded in a Cr matrix have many properties in common with 
Fe/Cr multilayers. The antiferromagnetic order in the Cr even close the in- 
terfaces, the antiferromagnetic coupling between Fe and Cr moments at the 
interface, the strong reduction of the local magnetic moments at Fe atoms 
close to the interface, the possibility of a slight enhancement of the Fe mo- 
ment beyond the bulk value at the center of a cluster or film, the strong 
changes of associated to frustrations, are all characteristics shared by 
Fe-Cr systems both in the form of embedded clusters or multilayer struc- 
tures [44]. One concludes that, at least in this case, the immediate local 
environment of the atoms gives the dominant contribution to the magnetic 
behavior. Nevertheless, for quantitative predictions a detailed determina- 
tion of the electronic structure and the geometrical structure at a larger 
length scale are certainly important. 

The results discussed in this section have shown that magnetic properties 
of Fe particles embedded in a matrix are qualitatively different form those of 
free (unsupported) clusters. For Foat in Cr or V the changes are so dramatic 
because the magnetic clusters interact very strongly with the TM matrix. 
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In the case of inert matrices or colloidal solutions the properties of embedded 
clusters often resemble those of free clusters in many respects. Clusters on 
surfaces experience both the reduction of the local coordination number, as 
free clusters, and the cluster-substrate interactions, as embedded clusters. A 
variety of very interesting magnetic behaviors results from these competing 
effects. Research in this direction is a subject of mayor current interest. 

4.5 Magnetic anisotropy and orbitai magnetism 

The electronic and magnetic properties of ferromagnets depend to some 
extent on the orientation of the magnetization with respect to the crystal 
structure and to the external shape of the system. The anisotropy of the 
ground-state energy (or of the free energy at T > 0) is one of the main 
characteristics of a magnetic material. It determines the low-temperature 
orientation of the magnetization with respect to the geometry of the system 
and the stability of the magnetization direction in the case of single-domain 
particles. The magnetic anisotropy energy (MAE) - defined as the energy 
difference involved in changing the magnetization direction from the low- 
energy direction or easy axis to a high-energy direction or hard axis - is a 
property of crucial importance in technological applications {e.g., magnetic 
recording or memory devices) where the magnetization must be pinned to 
a given direction in space. In the case of clusters, the magnetic anisotropy 
deserves a special attention, not only from a purely theoretical point of view 
but also because of its implications in cluster-beam Stern-Gerlasch experi- 
ments [11,12]. In this section we consider the main relativistic corrections 
at the origin of magnetic anisotropy in transition metals and we discuss the 
size and structural dependence of the MAE and orbital magnetic moments 
L. In particular we investigate the transition from large atomic-like L, given 
by Hund’s second rule, to bulk-like quenching of L as a function of cluster 
size. 

4.5.1 Relativistic corrections 

So far we have discussed the magnetic properties of clusters at a purely 
non-relativistic (NR) level taking into account of course that the electrons 
are spin-1/2 fermions. In non-relativistic quantum mechanics the total spin 
operator S = Si commutes with the Hamiltonian operator and therefore 
the direction of the magnetization relative to the cluster structure plays no 
role in the electronic properties. Thus, in the absence of an external mag- 
netic field the NR ground-state and excited-state energies are independent 
of the direction of the magnetization. The magneto-anisotropic behavior 
of magnetic materials is the result of relativistic corrections to the single- 
electron dynamics (Dirac equation) and to the electron-electron interaction 
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(Breit interaction) (see, for instance, Ref. [46]). The leading contributions 
are of second order in the ratio vjc between the velocity of the electrons 
and the velocity of light. 

The relativistic corrections to the electron-electron interaction have two 
origins. First, the electrons are moving charges and therefore each electron 
interacts with the magnetic field generated by the current and by the spins of 
the other electrons. Second, the electromagnetic interactions are mediated 
by photons which travel at the speed of light and are thus affected by 
retardation effects. In the Hartree approximation to the Breit interaction, 
the magnetic dipole-dipole energy is given by [46,47] 
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where fh{r) = {fh{r)) is the magnetization density. For transition metals 
we may replace equation (4.12) by the sum of the dipole-dipole interactions 
between the local magnetic moments jli at each atom i\ 
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The dipole-dipole interaction energy between pairs local moments is very 
small even at NN distances (Sdd — 3/r eV for Rq ~ 2.5 A and ^ ~ l/ie)- 
However, E^d decreases slowly as a function of distance {Eud ~ ^/Rij) 
and the summation over pairs ij converges very slowly. Therefore, £1 dd 
depends on the shape of the nanoparticle. In the case of small clusters the 
main contribution to the magnetic anisotropy energy results from spin-orbit 
interactions. 

The other relativistic corrections to the Schrodinger equation come from 
the single-electron dynamics which is ruled by the Dirac equation. The non- 
relativistic limit of the Dirac equation - including the terms up to the order 
(v/c)^ - is the Pauli Hamiltonian Tfpauii = + R-SO [46]. One 

distinguishes, on the one side, the non-relativistic (Schrodinger) term TYnr 
and the scalar relativistic corrections TisRj usually included in the external 
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(pseudo)potential that defines the hopping integrals (Eq. (4.2)). On the 
other side, one finds the spin-orbit interactions 

= (4,13) 

which are qualitatively important since they modify the symmetry of the 
wave function. The spin-orbit coupling represents the interaction of the 
magnetic moment of the electron with the magnetic field that results from 
the electronic motion relative to the lattice potential V(f). In other words, 
the magnetic moment of the electron interacts with the magnetic field that 
is generated by V (r) as seen in the reference frame of the electron. Since 
the electric field is strongest close to the nuclei, the SO effects are most 
important for the heaviest elements and for the most localized orbitals 
(core-electron states and d or f valence orbitals). Approximating V by 
its spherical average, one may write 

dV ^ 

'Hso = 0 2 2 a{r)£- s. 

Zm^c'^r or 

For the d-electrons in transition metals it is a good approximation to con- 
sider only the intra-atomic terms. Thus, Tfso may be expressed as 

Tdso = i E (•^ * a' C^im' a' i 

where ^ = f a(r) Rsdir) dr is the spin-orbit coupling constant (^ > 

0), and {£ ■ ^m<T,ni'cr' are the matrix elements of £ • s among the d states. 

A simple physical picture for the MAE can be derived in the limit of 
saturated magnetic moments with large exchange splitting Sidj — Sidi = 
J fd [48]. In this case, at lowest-order perturbation theory, we may neglect the 
terms that mix spin-up and spin-down states. Thus, TLso — (^/2) ~ 

£izi) can be regarded as a magnetic field acting on the orbital moment £iza- 
Moreover, (7Yso)/-^a = (C/2) ((Azt) “ (^zi)) where {L^„) is the average 
orbital moment per atom. Taking the up spins as the majority ones, we 
have {Lzi) — 0 for Ud < 5, and (A^j) ~ 0 for Ud > 5. Thus, {'Hso)/^a — 
±(^/2)(L2) where the -I- (— ) sign corresponds to < 5 {ud > 5). As in 
the atom, antiparallel (parallel) alignment of (A^) and (Sz) is favored for 
rid < 5 {nd > 5). The lowest-energy magnetization direction (easy axis) 
is the one yielding the largest orbital moment. The spin magnetization is 
“turned” to the direction yielding the largest (A). 



4.5.2 Magnetic anisotropy of small clusters 

In Figure 5 results are given for the MAE AE, the orbital angular momen- 
tum {Ls) along the magnetization direction S (S = x, y, z), and the average 
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Fig. 5. (a) Magnetic anisotropy energy (MAE), (b) orbital magnetic moment 
(Ls) and (c) spin magnetic moment (Sz) of Fe 4 with rhombohedral structure as 
a function of the bond length d (dn = bulk NN distance). The magnetization 
direction 5 = ® is along the middle bond, the direction 5 = y is perpendicular to 
X and within the plane of the cluster, and 5 = 2 is perpendicular to x and y. 



spin projection {Sz) of an Fe 4 cluster with rhombohedral structure [33]. 
The two-center hopping integrals and the intra-atomic Coulomb integrals 
used in the calculations are the same as in previous sections. The value of 
the spin-orbit coupling constant corresponding to Fe is ^ = 0.05 eV [33]. 
The results are given as a function of the bond length d in order to ana- 
lyze the role of cluster relaxation and to infer the possible coupling of the 
magnetization direction to vibrations and distortions. Only {Sz) is shown, 
since the magnitude of the spin magnetization [(S')] depends very weakly on 
the considered direction (typically, [(^z) — (S'x)] ~ 10“^-10“^). Comparing 
Figures 5a and 5c it is clear that the variations of the MAE are related to 
the variations of {Sz) and to the resulting changes in the electronic spec- 
trum. For large values of d/dB the spin magnetic moments are saturated, 
i.e., {Sz) — (10 — r'd)/2 = 3/2. When d/ds decreases, discrete changes in 
the spin polarization occur and non-saturated moments are obtained (see 





368 



Atomic Clusters and Nanoparticles 



Table 1. Size and structural dependence of the magnetic anisotropy energy 
(MAE) of Feiv clusters. The off- plane MAE AE = Ex — Ez and the in-plane 
MAE AE = Ex — Ey (results in brackets) are given in MeV for different values of 
the inter-atomic bond-length d (dn = bulk NN distance). Different structures are 
considered for each cluster size: (a) triangle, (b) chain, (c) square, (d) rhombus, 
(e) trust, (f) square pyramid, (g) triangle, (h) square bipyramid, (i) hexagon, and 
(j) pentagonal bipyramid (see Ref. [33]). 



N Struct. 


AE{d/dB = 1.05) 


AE{d/dB = 1.00) 


AE{d/dB = 0.90) 


3(a) 


5.13 (1.43) 


5.30 (-1.01) 


-0.11 (-0.10) 


(b) 


2.01 


1.69 


1.25 


4(c) 


5.02 (-0.48) 


5.21 (-0.59) 


-1.03 (-0.11) 


(d) 


0.30 (-0.12) 


0.27 (-0.12) 


0.33 (0.31) 


5(e) 


0.29 (-1.01) 


0.28 (-0.82) 


-0.10 (-0.75) 


(f) 


1.09 (-0.01) 


0.88 (-0.03) 


-0.07 (0.04) 


6(g) 


-1.25 (-0.76) 


-1.12 (-0.62) 


0.33 (0.29) 


(h) 


4.66 (0.04) 


4.82 (-0.02) 


-0.07 (-0.23) 


7(i) 


1.78 ( 0.40) 


1.91 (0.09) 


2.22 (0.30) 


(j) 


4.32 (-0.03) 


4.52 (-0.02) 


-0.39 (0.00) 



Sect. 4.4.1). For constant values of {Sz), the MAE and orbital momentum 
{Ls) vary continuously, since the electronic spectrum and the local mag- 
netic moments are continuous functions of However, this is not the 

case when {Sz) changes, since a strong and discontinuous redistribution of 
the spin-polarized density takes place. Important changes in the energy- 
level structure occur around £p that modify the details of the SO mixing. 
The resulting changes in the electronic energy depend on the explicit form 
of Tiso and therefore on the direction of the magnetization (see Fig. 5). 
Consequently, very significant and discontinuous variations of the MAE are 
observed which may even lead to a change of sign of the MAE. Notice, that 
the rhombohedral Fe 4 cluster presents a remarkable in-plane anisotropy 
{Ex — Ey) ~ 0.4 eV. This indicates that uniaxial anisotropy models are 
not directly applicable to clusters that have strongly reduced symmetry. A 
similar situation is found for other clusters and band-fillings [33]. 

In Table 1 results are given for the MAE of Fe clusters with different 
structures and interatomic distances [33] . The following general trends are 
obtained: 

i) the MAE is much larger in small clusters than in the corresponding 
crystalline solids. In fact, the anisotropy energy AE is often even 
larger than in thin films. For instance, AE ~ 4—5 MeV is a typical 
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value for clusters (see Tab. 1). This is in agreement with experiments 
on free clusters and supported Fe nanoparticles [29]; 

a) AE depends much more sensitively than the spin moments on the 
geometrical structure of the cluster. Indeed, changes of sign in AE 
are found as a function of the interatomic distance d even in situations 
where the magnetic moments are saturated and consequently do not 
depend on the cluster structure; 

Hi) the in-plane MAE’s are considerably important in general. In some 
cases they are even larger than the usually considered off-plane 
anisotropy. The in-plane MAE is of course largest for low-symmetry 
structures and decreases, though not monotonically, as the angle be- 
tween non-equivalent x and y directions decreases. Experiments sup- 
port these conclusions. 

4.5.3 Enhancement of orbital magnetism 

In atoms, Hund’s rules predict maximum orbital angular moment L compat- 
ible with maximum spin multiplicity. In transition-metal (TM) solids, elec- 
tron delocalization and band formation result in the almost complete disap- 
pearance or quenching of (L) . Such intrinsic differences between atomic and 
bulk behaviors are characteristic of systems developing itinerant-electron 
magnetism. Consequently, the orbital magnetism in TM clusters is expected 
to show novel size-dependent effects that are important both from a fun- 
damental standpoint, and in view of applications of cluster-based magnetic 
materials. 

The previous general considerations indicate that (L) should be very 
sensitive to the local atomic environment. In fact, as already discussed in 
Section 3, recent experiments show that the orbital moments at TM sur- 
faces, thin films, and nanostructures are typically 100-200% larger than in 
the corresponding solids [49,50]. Moreover, a size-dependent enhancement 
of (L) should have direct consequences on the results for the average mag- 
netic moments per atom and on the comparison with experiment [24, 26] . It 
is therefore important to investigate the dependence of orbital magnetism 
on variables such as size, structure, local atomic environment, and d-band 
filling. 

In order to analyze the crossover from atomic L to bulk-like quenching 
of (L) with increasing size we focus on Nijv clusters. Besides its experimen- 
tal relevance, Ni is a very interesting system since the differences between 
atomic and bulk orbital moments are dramatic {L = = 2/ie in the 

atomic configuration, and {L) = Lb = 0.05/tb in the solid). More- 
over, the spin moments in Ni^v are the smallest among the ferromagnetic 3d 
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Fig. 6. Average orbital magnetic moment per atom {Ls) = | £c 5 (*)|/N of 

Niat clusters. Results are given for fcc-like (dots) and icosahedral-like structures 
(crosses). The magnetization direction <5 is taken along a principal C„ symmetry 
axis (5 = 2 , full lines) or along a NN bond perpendicular to z {5 = x, dashed 
lines). Full (open) triangles refer to coin- like bilayer clusters with perpendicular 
(in-plane) magnetization. Results for small clusters are given in the inset. The 
lines are a guide for the eye [52]. 



TM’s, and therefore the contribution of (L) to the total magnetic moment 
(M) = 2(5') -I- (Z) should be particularly significant. 

In Figure 6 results are given for the average orbital moment per atom 

N 

{Ls) = \Y,Ls{z)\/N 

i=l 

of Mat clusters having N < 169, and face-centered-cubic (fee) or 

icosahedral-like geometries. S = x,y, and z refers to different orientations 
on the magnetization. On the one side, we observe that the reduction of 
system size causes a remarkable enhancement of (Ls) with respect to the 
bulk. Values about eight times larger than Lb = 0.05/tb are not uncommon. 
On the other side, comparison with the atomic result La = 2 ^b shows that 
the largest part of the quenching of L takes place already at the smallest 
clusters, as soon as full rotational symmetry is lost. For example, for Nis 
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(triangle) we obtain (L^) = 0.40^b, and for M4 (tetrahedron) (L^) = 
0.30/iB- Concerning the size dependence, one observes that {Lg) decreases 
with increasing N showing some oscillations as the bulk limit is approached 
(see Fig. 6). Notice that an important enhancement of {Ls), about 100%, 
is still present even for the largest considered sizes {{Ls)/L\, = 1.8 for N = 
165). In small clusters, {Ls)/Lb ranges from {Ls)/Lb ~ 3 for IV = 50—80, 
to {Ls)/L\^ ~ 6 for IV = 10—20. Large (L^) are also found for N < 10, 
as shown in the inset figure. The results correspond here to the optimized 
cluster geometries determined in the framework of the local spin density 
approximation [51]. One concludes that orbital magnetism is the source of 
an important contribution to the size-dependent magnetic properties of Ni 
clusters. 

In Figure 6 results are also given for icosahedral clusters. In this case 
{Ls) decreases with increasing N as for the fee geometries but significant 
quantitative differences are observed. The five-fold symmetric structures 
yield in general larger {Lg) than the cubic symmetric ones. Strong struc- 
tural dependences are also found in smaller clusters. Notice that the results 
for N > 13 given in Figure 6 concern mainly highly-symmetric structures 
with nearly spherical shape and closed nearest neighbors (NN) shells. Clus- 
ters with lower symmetry usually present a richer size dependence and larger 
orbital moments, particularly when the number of surface atoms is larger. 
This could affect the convergence to bulk- like quenching for large N . Re- 
markable structural effects are expected for clusters deposited on surfaces, 
whose shape can be experimentally tuned to some extent by changing the 
landing velocities [50]. In order to illustrate this effect we present results 
for two-layer-thick coin-like Nijv, as indicated by the triangles in Figure 6 
{N = 144 and 158). In these two-dimensional islands, {Lg) is much larger 
than for three-dimensional fee clusters of comparable size. The enhance- 
ment with respect to the bulk is here about 200%. Similar trends are found 
for other TM’s, in qualitative agreement with recent experiments on de- 
posited Fe particles [50]. Structure and shape have a strong influence on 
the orbital moments of TM nanoparticles. 

The enhancement of {Lg) can be related to the changes in the local en- 
vironment of the atoms. First, the reduction of local coordination number 
with decreasing N causes a reduction of the effective d-band width and an 
decrease of the average energy-level spacings. Consequently, the local spin 
polarizations {Sz{i)) are increased, which induces larger orbital moments by 
means of the spin-orbit interactions. Second, orbital polarization (OP) con- 
tributions resulting from the orbital dependence of the Coulomb integrals 
Umm' and Jmm' amplify this effect {Umm' — U — mm! B with B ~ 0.1 eV). 
This amounts typically to 30—50% of the enhancement of {Lg). Third, the 
reduction of the effective band-width and the presence of degeneracies in 
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the single-particle spectrum favor a more effective spin-orbit mixing, which 
enhances {Ls{i)) even in situations where the spin polarizations are satu- 
rated {2{Sz) ~ 10 — Ud)- Notice that the changes in (S'z) and {Lg) involve 
different energy scales. Although the exchange energies dominate over SO 
and OP energies (J ~ 1 eV, i? ~ ^ ~ 0.1 eV), the occupations of the differ- 
ent m orbitals remain very sensitive to SO and OP interactions, since they 
may vary without altering the spin polarizations. Compare, for example, 
icosahedral and fee clusters in Figure 6. Finally, there are less predictable 
effects related to the details of the electronic structure and its dependence 
on cluster geometry {e.g., the presence of high-symmetry axes, changes in 
bond-length, etc.) which may affect the orbital moments depending on the 
d-band filling. A similar analysis holds for transition-metal surfaces and 
thin films [53,54]. 

The environment dependence of the local orbital moments Ls{i) pro- 
vides further insight on the enhancement of {Ls) in small clusters, and on 
the development of bulk-like quenching for large N . One observes that Ls(i) 
generally increases, with some oscillations, as we move from the center to 
the surface of the cluster [52]. Thus, the enhancement of {Ls) is driven by 
the cluster surface, as it is the case for the spin moments. For small N 
{N < 55) the enhancement of Ls{i) concerns practically all atoms, includ- 
ing those with a complete NN shell, well below the surface. For larger sizes 
{N > 135) bulk-like quenching starts to set in at the interior of the clus- 
ter, leaving significantly enhanced orbital moments only at a few outermost 
shells. As expected, one approaches the behavior found for TM surfaces, 
where enhanced Ls{i) are restricted to a few uppermost layers [54]. A 
droplet model, i.e., {{Ls) — Ls) oc is in fact a good first approxima- 

tion for N > 150. Notice, however, that the convergence to Lb is rather 
slow, even for N ~ 200. The fact that Ls{i) — Lb for inner atoms indicates 
that cubic point-group symmetry - which is absent in all but the central 
atom - is not essential for reaching almost bulk-like quenching. Instead, a 
local picture, namely, the recovery of bulk-like local atomic environment, is 
a more appropriate interpretation. 

Finally, let us point out that the contribution of the orbital moments - 
that in Ni align parallel to the spin moments - increases the value of the 
total magnetic moment (L) -|- 2{S) predicted by theory, thereby improving 
the agreement with experiment. In fact, despite some quantitative differ- 
ences among the results obtained by different groups, available spin-only 
calculations yield ground-state spin magnetizations that underestimate sys- 
tematically the experimental Jin by about 0.3— 0.6 ^b for < 13 [51]. The 
largest part of this discrepancy is removed by including the enhanced orbital 
contributions. 
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5 Electron-correlation effects on cluster magnetism 

A deeper understanding of the electronic correlations underlying the mag- 
netic properties of clusters is not only important from a fundamental point 
of view, but it also crucial in investigations of more delicate properties such 
as excitation spectra, magnetic behavior at finite temperatures or even the 
geometry of magnetic clusters. It is therefore necessary to go beyond sim- 
ple mean-field approximations by including charge and spin fluctuations 
explicitly. The simultaneous determination of electron-correlation effects, 
magnetism and cluster geometry is a very difficult task. In fact, most the- 
oretical studies performed so far have attempted to deal with only part 
of these aspects of the problem at a time. The localized character of the 
atomic-like 3d orbitals, and the complicated dependence of the magnetic 
moments and magnetic order on cluster geometry, have prevented the de- 
velopment of rigorous first-principles treatments of electron correlations. 

5.1 The Hubbard model 

The mean-field approximation discussed in Section 4.1 could be improved 
systematically, for example by treating the residual interactions by pertur- 
bation methods or by using Gutzwiller or Jastrow variational Ansdtze [55, 
56]. However, a general implementation of such calculations for TM clus- 
ters would be very demanding, particularly if the structure is arbitrary, 
i.e., lacking of any symmetry. An alternative approach which has provided 
numerous significant results is to consider a simpler electronic model which 
allows an exact or at least very accurate solution of the many-body problem 
and which at the same time contains enough complexity to be able to shed 
light on the physics of real systems. With this in mind, we simplify the 
model given by equations (4.5) and (4.6) by droping the d-band degeneracy. 
One obtains then the single-band Hubbard Hamiltonian [57] 

H = -t 4aCj<r + U (5.1) 

where {cia) refers to the creation (annihilation) operator for an electron 
at site i with spin a, and fiia = c\^Cia to the corresponding number opera- 
tor. The first term is the kinetic-energy operator, which describes electronic 
hoppings between nearest neighbor (NN) sites i and j leading to electron 
delocalization and bond formation (t > 0). The second term takes into 
account the intra-atomic Goulomb repulsion, which is the dominant contri- 
bution from the electron-electron interaction {U > 0) [57]. Equation (5.1) 
can be considered as a minimum model for describing itinerant electrons 
in a lattice. At low energies the electronic properties result from a del- 
icate balance between the tendency to delocalize the valence electrons in 
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order to reduce their kinetic energy, and the effect of the Coulomb repul- 
sions associated to local charge fluctuations. The relative importance of 
these contributions depends strongly on the electron density or number of 
electrons on the total spin S and on the ratio U /t. 

The Hubbard model for small clusters is solved numerically by expand- 
ing its eigenfunctions Idr;) = Oiim\^m) in a complete set of basis states 







\vac) which have definite occupation numbers n™ at 



all orbitals ia {fna\^m) = n^\^rn) with n™ = 0 or 1). The values of 
satisfy the conservation of the number of electrons v = and of the z 

component of the total spin Sz = where Taking 

into account all possible electronic configurations may imply a considerable 
numerical effort which in practice sets a drastic limit to the size of the 
clusters under study. For example, at half-band filling and minimal Sz the 
dimension of the Hilbert space D = is D = 853 776, 2 944 656 and 

11778 624, for N = 12, 13 and 14 respectively. For not too large clusters, 
the expansion coefficients aim corresponding to the ground state (I = 0) 
and low-lying excited states are determined by sparse-matrix diagonaliza- 
tion procedures such as the Lanczos iterative method [58]. Several cluster 
properties can be calculated in terms of the coefficients aim of the eigen- 
state |4'/) by simple operations on the basis states [4)^), for example, the 
spin-density distribution 



m 



the density-density correlation functions 

m 



(5.2) 



(5.3) 



and the spin correlation functions 

= i E - OK"t - <i)- (5-4) 

m 

Within a controlled accuracy, the results are thus exact in the framework 
of the Hubbard model. 

5.2 Geometry optimization in graph space 

The Hubbard model considers a single s-like orbital per site and NN hop- 
pings. The hopping integrals Uj between sites i and j take only two possible 
values, namely, tij = —t if Rij = Rq, and Uj = 0 if Rij > i?o, where Rij 
refers to the interatomic distance and Rq to the NN distance. Consequently, 
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only the topological aspect of the structure is relevant for the electronic 
properties. In other words, defining the cluster structure is equivalent to 
defining for each atom i those atoms j which are connected to z by a hopping 
element t. This implies that the set of all possible non-equivalent cluster 
structures is a subset of the set of graphs with N vertices [59] . For example, 
for N = 3 there are two different graphs (triangle and linear chain) and for 
TV = 4 these are six (tetrahedron, rhombus, square, star, etc.). Taking into 
account only NN hoppings with fixed bond length results in a discretization 
of the configurational space so that the geometry optimization can be per- 
formed within graph space. Notice, however, that the number of graphs rig 
or site configurations to be considered increases extremely rapidly with TV. 

For the study of clusters one must consider only those graphs which can 
be represented as a true structure in real space. A graph is acceptable as 
a cluster structure if a set of atomic coordinates Ri (i = 1, , TV) exists, 
such that the interatomic distances Rij satisfy the conditions Rij = Rq if 
the sites i and j are connected in the graph (z.e., if Uj = —t) and Rij > Rq 
otherwise (z.e., if tij = 0). While for TV < 4 all graphs are possible cluster 
structures, this is no longer true for TV > 5. For example, for TV = 5 it is 
not possible to have all atoms being NN’s from each other. With increasing 
cluster size it becomes increasingly difficult to identify the graphs which are 
not representable as a structure in TD < 3 dimensions. This problem can be 
solved with a computer algorithm based on the optimization of objective 
functions under different boundary conditions [16]. 



5.3 Ground-state structure and total spin 

The results for the most stable structure and the corresponding total spin 
S' as a function of U/t and v are summarized in the form of “phase” or 
structural diagrams as the ones shown in Figure 7. The main conclusions 
and trends derived from the calculations for TV < 8 are summarized below. 

For low electron or hole concentration (z.e., v/N < 0.4— 0.6 and 2 — 
v/N < 0.3-0. 6) the optimal cluster structure is independent of U/t, z.e., the 
structure which yields the minimal kinetic energy (uncorrelated limit) re- 
mains the most stable one, irrespectively of the strength of the Coulomb in- 
teractions. Furthermore, no magnetic transitions are observed: the ground- 
state is always a singlet or a doublet (see Fig. 7). This indicates that, for 
low carrier concentration, the Coulomb interactions are very efficiently sup- 
pressed by the correlations, so that the magnetic and geometric structure 
of the clusters are dominated by the kinetic term. This physically plausi- 
ble conclusion is further supported by analytical results for the case of two 
carriers. Nonetheless, the fact that this holds for finite values oi v/N and 
U/t ^ -koo, seems not obvious a priori. 
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2 3 4 5 6 7 8 9 ' 10 ' 11 ' 12 

V 



Fig. 7. Structural diagram of Hubbard clusters having N = 7 sites. The ground- 
state structures corresponding to a Coulomb repulsion U, hopping integral t and 
number of electrons v are illustrated. The ground-state spin S is minimal (i.e., 
S' = 0 or S = 1/2) unless indicated. 



For small we obtain compact structures having maximal average coor- 
dination number z {tij = —t < 0). These are all substructures of the icosahe- 
dron and have the largest possible number of triangular loops. 
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In contrast, for large v (small = 2N — v) open structures are found. 
In particular for = 2 we obtain bipartite structures, which have the 
largest possible number of square loops. This can be qualitatively under- 
stood in terms of the single-particle spectrum. In the first case (small v) the 
largest stability is obtained for the largest band-width for bonding (negative- 
energy) states (£b < —zt), while in the second case (small i^h) it is obtained 
for the largest band-width for antibonding (positive-energy) states, i.e., for 
the most compact bipartite structure. 

A much more interesting interplay between electronic correlations, mag- 
netism and cluster structure is observed around half-band filling (z. e., \iy/N— 
1| < 0.2-0. 4). Here, several structural transitions are found as a function of 
U/t (see Fig. 7). Starting from the ground-state structures for [7 = 0, one 
observes that as U is increased, first one or more of the weakest cluster bonds 
are broken. This change to more open structures occurs for C//t ~ 1—4 and 
is most often seen for v < N , since in this case the U = 0 structures are more 
compact, while for v > N the structures are rather open already for U 0 
(see Fig. 7). As U is further increased {U/t > 5—6) it is energetically 
more advantageous to create additional new bonds. Higher coordination 
gives the strongly correlated electrons more possibilities for performing a 
mutually avoiding motion that lowers the kinetic energy. Moreover, these 
compact structures are in general more symmetric and the electron-density 
distribution is more uniform, which also contributes to lower the Coulomb- 
repulsion energy. 

The structural changes at larger U are often accompanied by strong 
changes in the magnetic behavior. One may indeed say that these structural 
changes are driven by magnetism [60]. For half-band filling {ir = N) the 
optimal structures show minimal total spin S and strong antiferromagnetic 
correlations. None of the structures having a (unsaturated) ferromagnetic 
ground-state [61] were found to be the most stable ones for any value of U/t. 
The optimal antiferromagnetic structures are non-bipartite. For instance, 
the rhombus is more stable than the square for N = v = 4 (see also Fig. 7). 
The bonds that are frustrated in a static picture of antiferromagnetism, 
yield an appreciable energy lowering when quantum fluctuations are taken 
into account. Therefore, Hubbard clusters with one electron per site and 
large U/t can be best seen as frustrated quantum antiferromagnets. 

For all studied cluster sizes {N < 8), the most stable structures for 
V = N + 1 show ferromagnetism for large U (typically U /t > 4—14). This 
is in agreement with Nagaoka’s theorem [62]. For the smaller clusters, i.e., 
N = 3, 4 and 6, this is the only case where the optimal structures are 
ferromagnetic. For N = 5 unsaturated ferromagnetism {S = 1) is also 
found for = 4, though at large values of U {U/t > 30). For larger clus- 
ters, ferromagnetism extends more and more throughout the {U /t) — {v /N) 
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phase diagram. Clusters with N = 7 {N = 8) are ferromagnetic for i' = 4, 
6, 8 and 10 (v = 9—12). The tendency towards ferromagnetism is much 
stronger above half-band filling than below. This is qualitatively in agree- 
ment with experiments on 3d-TM clusters. In fact one observes that the 
magnetic moments per atom /r in V and Cr clusters are very small if not zero 
{fi < 0.6— 0.8/iB) [22], while Fe and Co clusters show large magnetizations 
[12,24,25]. Finally, let us remark that the appearance of ferromagnetism is 
much less frequent than what one would expect from mean-field Hartree- 
Fock arguments (Stoner criterion). This reflects the importance of cor- 
relations in low-dimensional systems, as it will be discussed in the next 
section [60,63]. It should be nonetheless noted that the Hubbard model 
for clusters probably exaggerates the effects of quantum fluctuations, since 
it is one of the most extremely low-dimensional systems one can consider. 
Improvements on the model, either by including several bands or non-local 
interactions, should tend to weaken such strong fluctuation effects. 

The Hubbard model is in accordance with nature in a further interest- 
ing aspect. While the number of possible site configurations or graphs rig 
increases in an explosive way with the number of atoms N, the number of 
structures Uq which are optimal for some value of the parameters U/t and 
v/N remains a handful. For example, for N = 7 (8), Ug = 853 (11 117) 
while no = 18 (23). As the cluster size increases, some growth patterns 
start to dominate and the same or very similar structures cover larger and 
larger regions of the phase diagram (see Fig. 7). The situation tends to what 
one observes in the macroscopic limit (solid state) where - regardless of the 
infinite number of topologically different atomic arrangements - the equi- 
librium lattice structures of the elements are only a few (fee, hep, bcc, etc.). 



5.4 Comparison with non-col linear Hartree-Fock 

The unrestricted Hartree-Fock (UHF) approximation developed in 
Section 4.2 in the context of spd-band models can also be applied in an 
straightforward way to the single-band Hubbard model [35]. Comparison 
between mean-field and exact results allows to quantify the role of electron 
correlations on various magnetic and structural properties. 

For the Hubbard model the UHF Hamiltonian reads 

77uHF — f ^ U ^ ] (yPla , (^-^) 

{lm)<T 1(7 



from which a single-determinant approximation jUHF) to the ground state is 
obtained. pia,ia' = are the matrix elements of the density matrix, 
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where (...) = (UHF|...|UHF) implies self-consistency. The distribution of 
the electron density is given by 

(n/) = +P/i,4. (5.6) 

and the spin polarization {Si) = {{Sf , Sf , Sf)) by 

(Sf) = (P4.4 + P4,d) /2, 

(Sf) = -i (P4.4 “ P 4 . 4 ) /2. 

(^f) = (m.4-m.4)/2- (5.7) 

Notice that the local magnetic moments {Si) are collinear only if pia^ig = 0, 
Vh In Figure 8 the selfconsistent spin arrangements obtained in fee clusters 
with ly = N = 13 are shown for representative values of U/t. 

In Figure 9 UHF and exact results are given for the local moments 
pf = {Sf) and spin correlations {Si ■ Sm) (fcc-like 13-atom cluster). The 
UHF values for pf are quantitatively not far from the exact ones. The 
uncorrelated limit {U = 0) is reproduced and a fairly good agreement is 
also obtained in the large U/t regime. Main trends such as the larger p/f at 
the cluster surface for small U/t and the reduction of the difference between 
surface and inner moments for large U/t come out correctly. However, 
UHF underestimates the increase of pf for U/t < 3.7 and anticipates the 
tendency to localization with increasing U/t. This results in a much more 
rapid crossover from weak to strong interacting regimes and in significant 
inaccuracies for intermediate U/t. 

The quantitative discrepancies are more important in the case of spin 
correlation functions {Si-Sm) particularly for large U/t (Fig. 9b). Here 
we find that UHF underestimates the strength of AF spin correlations 711 
at the surface. This can be interpreted as a consequence of the formation 
of permanent local moments that blocks quantum spin fluctuations along 
the transversal directions. Still, in both UHF and exact calculations, the 
increase of AF correlations at the surface (increase of |7n|) is done at the 
expense of a decrease of the spin correlations with the central atom (decrease 
of |7oi|). AF correlations among the larger surface magnetic moments dom- 
inate, while the AF couplings between central and surface atoms tend to 
cancel out. The discrepancies between UHF and exact results for = 13 
show the limits of mean-field and give us an approximate idea of the cor- 
rections to be expected in correlated calculations on larger clusters. As 
expected, UHF yields better results for properties like the local moments, 
that are related to the spin density distribution, than for the correlation 
functions. Nevertheless, since the trends given by UHF are qualitatively 
correct, one can be reasonably confident on the validity of the conclusions 
derived using mean- field. 
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Fig. 8. Illustration of the UHF magnetic order in fcc-like clusters having A = 13 
atoms, = 13 electrons and representative values of U /t (Hubbard model), 
(a) U jt < 3.7: collinear spin arrangement; (b) 4.8 < U /t < 5.1: all local spin 
polarizations {Si) within the (111) plane; (c) U jt > 5.1: non-collinear spin ar- 
rangement in 3 dimensions. The arrows indicate here the projection of {S{) on to 
the (111) plane. 



The determination of the cluster structure is a central problem in cluster 
research in particular for magnetism since, as we have seen, structure and 
magnetic behavior are interrelated. Most calculations of cluster structures 
are based upon mean- field approximations. Therefore, it is very interest- 
ing to evaluate the role of electronic correlations on cluster structure. In 
order to explore this problem we determine the relative stability of a few 
representative cluster structures in the framework of the UHF approxima- 
tion and we compare the results with the exact solution [16]. Four different 
symmetries are considered: icosahedral, face centered cubic (fee), hexago- 
nal close packed (hep), and body centered cubic (bcc) clusters. The mag- 
netic and structural diagrams for TV = 13 are shown in Figures 10 and 11. 
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Fig. 9. UHF and exact results for an fcc-like 13-atom cluster {v = N = 13). 
(a) Local magnetic moments (Sf) at the central site {I = 1, dashed) and at the 
cluster surface {I = 2—13, solid), (b) Spin correlation functions {Si ■ Sm) between 
the central site and its first NN’s at the surface ( 701 , solid) and between NN’s at 
the surface shell ( 711 , dashed). Curves with (without) crosses correspond to exact 
(UHF) results. 



A qualitative description of the magnetic order obtained in the UHF cal- 
culations is indicated by the different shadings. One may distinguish three 
different collinear spin arrangements: non-magnetic solutions (NM), non- 
saturated or weak ferromagnetic solutions (WFM) and saturated ferromag- 
netic solutions (SFM). The NM states are paramagnetic with minimal total 
moment = 0 or 1/2. Concerning the noncollinear spin arrangements 
we distinguish two cases: noncollinear nonmagnetic states (NC) in which 
non-vanishing (eventually large) local moments /i/ sum up to an approxi- 
mately minimal total moment < 1? and noncollinear ferromagnetic states 
(NCFM) that show a net magnetization /tt > 1. The NC states include all 
sort of frustrated antiferromagnetic-like spin structures, for example, those 
illustrated in Figure 8. 

For small U/t {U/t < 10) the UHF results are in agreement with the 
exact calculations (compare Figs. 10 and 11). The icosahedral cluster 
yields the lowest energy for low carrier concentrations {v < &)■ As al- 
ready discussed, the largest coordination numbers are favored for small 
since the kinetic energy dominates stabilizing the structure with the largest 
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Fig. 10. Magnetic and structural diagram of clusters having A = 13 atoms as 
obtained by using the Hubbard model in the unrestricted Hartree-Fock (UHF) 
approximation. Four different types of structures are considered: icosahedral 
(ICOS), face centered cubic (FCC), hexagonal close packed (HCP), and body 
centered cubic (BCC). The corresponding UHF magnetic orders are indicated by 
different shading patterns. 



bandwidth [16]. As v is increased several structural transitions occur. For 
small U /t, both exact and UHF calculations present the same structural 
changes: from icosahedral to fee structure at v = 11, from fee to hep at 
V = 17, and from hep to bcc &t v = 20. At larger U/t {U/t > 12) the 
interplay between the kinetic and Coulomb energies introduces correlations 
that cannot be accounted for within UHF and that are important to the 
magnetic and structural properties. A main source of discrepancy is the 
too strong tendency of UHF to yield SFM ground states, particularly above 
half-band filling. Consequently, the optimal structure is missed rather fre- 
quently in the limit of large U/t. For example, for = 19 and U/t > 18, 
UHF predicts the fee structure with a SFM ground state, while in the exact 
calculation the icosahedral cluster with S' = 1/2 is the optimum. Similar 
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V 

Fig. 11. Exact calculated structural and magnetic diagram of Hubbard clusters 
having = 13 atoms as obtained by considering the same four structures as in 
Figure 10. The corresponding ground-state spin S is minimal (S' = 0 or 1/2) unless 
explicitly indicated. Broken lines separate regions having the same structure but 
different S. For v < 9 the icosahedron yields the lowest energy for all U/t, while 
for V > 21 the bcc structure is the most stable. 



drawbacks are seen for other band fillings such a,s v = 7, 10, 21, and 22 
(large U/t). Still, in the event that the true ground state does show strong 
ferromagnetism, UHF succeeds since the ground state is the superposition 
of single-particle states. Examples of this kind are = 12, 14 and 20 for 
large U/t. These are rather exceptions since the presence of a FM ground 
state in the exact solution is in general much less frequent than predicted 
by UHF. Around half-band filling UHF reproduces qualitatively well the 
transition from fee or hep to icosahedral structure with increasing U /t as 
well as nontrivial re-entrant effects {v = 13—18). However, UHF underesti- 
mates the value of U/t at which the structural changes occur (see Figs. 11 
and 10). 

Summarizing, UHF fails to reproduce the exact phase diagram in de- 
tail, particularly in some of the most interesting AF or weak FM regimes. 
Structural transition are sometimes missing, and in other cases changes of 
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structures appear artificially. Nevertheless, it is also fair to say that UHF 
yields a good account of the relative stability between the considered struc- 
tures, even well beyond the weak coupling limit (up to C//t ~ 16). It also 
explains the larger stability of ferromagnetism above half-band filling, al- 
though in the limit of strong interactions the validity of UHF often breaks 
down. 

Despite some quantitative discrepancies, the local magnetic moments 
and the spin correlation functions given by UHF are qualitatively in ac- 
cord with the exact results. This suggests that UHF with noncollinear spin 
arrangements is a valid starting point for working on larger clusters and 
more complex multi-band models. Notice that the breaking of spin sym- 
metry and the formation of a permanent spin-density-wave states are often 
artifacts of the mean- field approximation. While the broken spin-symmetry 
can be restored, as proposed for instance in [56], the success of such an ap- 
proach is likely to depend on the geometry of the cluster. In low-symmetry 
structures UHF tends to exaggerate the formation of inhomogeneous spin- 
density distributions, which may be far from the exact solution and which 
may be difficult to correct a posteriori. In such cases a Jastrow-like varia- 
tional Ansatz on a restricted Hartree-Fock state is often more appropriate. 
Finally, let us point out that the effects of charge fluctuations and correla- 
tions are probably more drastic in the single-band Hubbard model than in 
d-band Hamiltonians, more appropriate for the description of TM’s. 



6 Finite-temperature magnetic properties of clusters 

The temperature dependence of properties reflecting phase transitions in 
finite systems are intrinsically different from the critical phenomena of the 
solid state [64]. In a ferromagnet, for example, the divergencies in the spe- 
cific heat C'p(T) and magnetic susceptibility x(T) at the Curie temperature 
Tq disappear, since the magnetic fluctuations at long wave-lengths are sup- 
pressed by the finite size of the cluster. Instead, these properties present a 
peak at a characteristic temperature Tc{N) with a size dependent width. 
In addition, the peaks in Cp(T) and x(T) occur in general at somewhat dif- 
ferent T. Therefore, the “Curie” temperature Tc{N) derived from different 
properties usually differ to some extent. 

General trends in the size dependence of Tc{N) seem difficult to in- 
fer a priori. On the one side, taking into account the enhancement of 
the local magnetic moments /r(i) and of the d-level exchange splittings 
Aea;(i) = — eq, one could expect that Tc{N) should be larger in small 

clusters than in the bulk. However, on the other side, it should be energet- 
ically easier to disorder the local magnetic moments in a cluster {e.g., by 
flipping or canting /i(z)) since the local coordination numbers are smaller. 
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If the later effect dominates, Tc{N) should decrease with decreasing N. 
In addition, recent model calculations [16] indicate that structural changes 
or fluctuations are likely to affect the temperature dependence of the mag- 
netization. This is particularly relevant for systems like FeAr and RhAr which 
are remarkably structural dependent already at T = 0. In order to derive 
reliable conclusions concerning the size dependence of Tc{N), the theory 
must take into account both the fluctuations of the magnetic moments and 
the itinerant character of the d electron states. Simple spin Hamiltonians, 
for example based on the Heisenberg or Ising model, are not expected to 
be very predictive, at least until they incorporate the electronic effects re- 
sponsible for the size dependence of the interactions between the magnetic 
moments. In fact, previous studies on TM surfaces have already shown that 
the effective exchange interactions Jij between NN moments /i(z) and /r(j) 
depend strongly on the local coordination numbers of sites i and j [65] . A 
similar behavior is also found in clusters, as it will be discussed below. 

6.1 Spin-fluctuation theory of cluster magnetism 

A complete understanding of the finite-temperature properties of magnetic 
TM’s remains an open problem even in periodic solids. Stoner’s mean-field 
theory gives a qualitatively correct description of the electronic structure 
and magnetization at T = 0 but fails completely at finite T. Considerable 
progress has been made in solids by using d-band model Hamiltonians and 
functional integral theory [6,7,66]. This formalism incorporates both the 
itinerant character of the d states and the local spin excitations (spin fluc- 
tuations) relevant at finite temperatures. The purpose of this section is to 
extend this theory to finite clusters. 

Consider the tight-binding Hubbard Hamiltonian for d electrons given by 
equations (4.5) and (4.6). For calculating the canonical partition function Z 
it is useful to rewrite the interaction term H as a sum of squared operators. 
Among several possibilities we choose, as proposed by Hasegawa [67], the 
local electron number operator Ni = + ^iai) and the local z- 

component of the spin operator Siz = ~ ^ial) at cluster site 

z, which retain explicitly the main physical variables of the problem. The 
transversal exchange terms are neglected for simplicity. In this way one 
obtains 



^ = ( 6 . 1 ) 
i ^ ' 

where U = (C/|j^ -I- C/||)/2 refers to the average direct Coulomb interac- 
tion between d-electrons and J = U^i — to the exchange integral (see 
Sect. 4.1). The Stratonovich-Hubbard identity can then be used to express 
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Z as a functional integral [68]. Neglecting irrelevant multiplicative factors, 
one obtains 




dj 7 i(t)d^i(t) exp 



where 






X Tr 



T exp < — 






it) 



H'{t) = H^t) + Uj2^Vj{t)m) - jJ2mSjAt)- (6.2) 

3 3 

The integration variables ^i{t) and rji{t) are interpreted as time-dependent 
exchange and charge fields at cluster site i, since they are coupled to the 
spin operator Siz and charge operator Ni, respectively. Z is given by a 
functional integral over the fields and rjiit) of the partition function 
Z' of noninteracting electrons moving under the action of these arbitrary 
time-dependent fields [^ = (^i, ...,^Ar) and fj = {rji, ...,7]^)]. The operator T 
ensures the conventional time ordering [69] . One may attempt to take into 
account the time dependence systematically by developing ^i{t) and r]i(t) 
in its Fourier components. The first term of such an expansion yields the 
so called static approximation which neglects the time dependences. The 
functional integral reduces then to an ordinary one: 

Z j djydC (6-3) 



where 



Z'i^,^) = exp{-/3F'(f,77)} 



Tr 



exp |— /? {h' — I 



(6.4) 



(6.5) 



The static approximation is exact in the highly correlated limit {t'^^ = 0, 
Vz yf j) where no charge fluctuations are present, and in the non-interacting 
limit {Uaa' = 0). For nontrivial cases it describes the dynamics of the 
electrons as independent particles moving in a random alloy with energy 
levels 



-k Uiri, 




( 6 . 6 ) 
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The thermodynamic properties of the system are given by a statistical aver- 
age over all possible distributions of throughout the cluster. For T ^ 0 
the field configuration ffl that minimizes the free energy domi- 

nates. This corresponds to the self-consistent equations 



dF' 
dii 0 
dF' 

diji Q 



^(e?-2(^i,))=0, 

- (N,)) = 0, 



(6.7) 

( 6 . 8 ) 



where (...) indicates ground-state average. Note that equations (6.7) 
and (6.8) are the usual Hartree-Fock equations (4.7) and (4.8) that were 
used in Section 4.4 for calculating the magnetic properties of clusters and 
nanostructures at T = 0. The static approximation is thus a suitable ex- 
tension of Hartree-Fock theory that includes spin fluctuations at finite tem- 
peratures. Notice that at T > 0 the fields ^i, rji fluctuate around 77° and 
other possible local minima. 

If one is mainly interested in the magnetic properties it is reasonable to 
neglect the thermal fluctuations of the charge fields rji. For each exchange 
field configuration we set rji where rj is the saddle point of F’(^, ff) 

given by 



dF' 



d'ni 



Vi 



-iu{if], - m) = 0 . 



(6.9) 



Physically, this means that the charge distribution (Ni) is calculated self- 
consistently for each Since the rji are implicit functions of 



Z^J d^Z'iO = J d^expi-PF'ii)} 



(6.10) 



depends only on the relevant exchange variables . The integrand in equa- 
tion (6.10) can be interpreted as proportional to the probability P{^) for 
a given exchange field configuration All thermodynamic quantities are 
given by an average over all possible ^ with exp{— /3A'(^)} as weighting 
factor. For example, the local magnetic moments are obtained from 

M.(r) = ^l dC2(^i,)e-^^'(«) = ly (6.11) 

This equation justifies the intuitive association between the fluctuations of 
the local moments 2{Siz) and those of the exchange field ^i. The cluster 
magnetization per atom is given by the average of the local magnetizations 
M*(T). 
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In order to discuss the physical behavior of the free energy F'{^) from 
which the magnetic properties are derived, and in particular the spin exci- 
tations for different cluster sites i, we consider the probability distribution 
function Pi{^) of having ^ at atom i, which is given by 



= I / n exp{-/3i^'(a , . . . , , e, ii+i, 

= ;^exp{-/3i^'(^)} • 



( 6 . 12 ) 

(6.13) 



For very low temperatures the integration in equation (6.13) may be simpli- 
fied by setting for I ^ i (see Eqs. (6.7) and (6.8)). The free-energy 

difference AF/(^) = F/(^) — F'(T = 0) obtained in this way, 



ae'(C) 



= F'ifi, ■ . • - F'ie „ . . . 



IT^O 



a), 

(6.14) 



represents the energy involved in an exchange-field fluctuation ^ above the 
Hartree-Fock ground state. 



6.2 Environment dependence of spin fluctuation energies 

We consider the low-temperature limit of the local free-energy F/(^) and 
compare the behavior of AF'(^) for different atoms i within the ferromag- 
netic cluster. This provides useful information on the stability of the lo- 
cal magnetizations and its environment dependence. In Figures 12 and 13 
results are given for AF)'(^) in Feg and Feis clusters with bcc-like struc- 
ture. First of all notice that AF'{^) = F'{^) — Fl(^ = p°) > 0 which 
indicates, as expected, that the ferromagnetic order is stable at low tem- 
peratures. The surface atoms of Feg, which have the largest local mag- 
netic moments /i° at T = 0, show two minima located at and 

~ —Pi- This double-minimum structure in F'(^) indicates that flips 
of the surface magnetic moments keeping their amplitude approximately 
constant are the dominant magnetic excitations. In contrast at the central 
atom, which has a much smaller p^, F'(^) has a single minimum. Here the 
amplitude fluctuations of ^ dominate. At the surface of the cluster only 
small ffuctuations of ^ are possible with AF/(^) smaller than the energy 
AFl{^~) = F'{^~) — F'{^~^) required to flip a local magnetic moment. The 
probability Pi{^) oc exp{— (3AFl{^)} has 2 sharp maxima at ^ ~ and 
^ with Pi{^~) Pi{^ = 0). The fact that very small clusters and 

particularly cluster atoms having small local coordination numbers Zi show 
such a Heisenberg- or Ising-like behavior is not surprising, since the kinetic- 
energy loss caused by dipping a local magnetic moment pi ^ ^ ~Ei] 
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Fig. 12. Local spin-fluctuation energy AF/(^) = F/(^) — F/(T = 0) as a function 
the exchange field ^ at different atomic sites i of Feg with bcc-like structure. Dots 
refer to the central atom and crosses to one of the eight surface atoms (NN of 
the central one). Results are given for d/de = 1.0 (full curves) and d/ds = 0.92 
(dashed curves), where d refers to the NN distance (ds = bulk NN distance). 



is smaller when zi is smaller (Sk oc y^). At the same time, the exchange 
energy Aifa, = (J/4)X)i/tf, being a local property, is much less affected 
by the change of sign of Even Ni, that in the solid state has a single 
minimum in F' (^) and is therefore dominated by amplitude fluctuations of 
^ [66], tends to show a double minimum structure in F'(^) for sufficiently 
small N. These results suggest the existence of a transition or crossover 
from Heisenberg-like to itinerant-like behavior with increasing cluster size. 
Physically this can be interpreted as a consequence of the competition be- 
tween the Coulomb interaction energy, which is relatively more important 
in small clusters, and the kinetic d-band energy, which is most important 
in the bulk. 

It is also interesting to determine how AF'(^) depends on the inter- 
atomic distances in order to infer the effects of structural distortions on the 
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Fig. 13. Local spin-fluctuation energy AF/(^) = F/(^) — F/(T = 0) as a function 
the exchange field ^ at different atomic sites i of Feis with bcc-like structure. 
Dots refer to the central atom, crosses to the first NN shell, and open circles to 
the outermost shell. Results are given for (a) d/db = 1.0 in full curves and (b) 
d/db = 0.96 in dashed curves, where d refers to the NN distance (db = bulk NN 
distance) . 



cluster magnetization curves. If the free-energy is optimized by changing 
the NN distances and by keeping the cluster symmetry unchanged (uniform 
relaxation) usually a bond-length contraction is obtained {d < ds) [10]. 
Therefore, we also show in Figure 12 results for Feg using d/de = 0.92. In 
this case one finds no qualitative, but strong quantitative changes in F'{^) 
as a function of Besides the shift of the position of the minimum at 
reflecting the reduction of the local magnetic moments at T = 0, 
one observes a remarkable reduction (about a factor 10) of the free energy 
AF'{^~) = F[ (^“) — Fl (^+) required to flip a local magnetic moment at the 
surface atoms. A similar large reduction of the cluster “Curie” temperature 
Tc{N) is expected to occur upon relaxation, since in first approximation 
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Tc{N) oc AF/(^“). These results show once more the strong sensitiv- 
ity of the magnetic properties of 3d TM clusters to changes in the local 
environment. As it will be discussed in the next section, recent calcula- 
tions including correlations effects exactly within the single-band Hubbard 
model have also revealed the importance of structural fluctuations for the 
temperature dependence of the magnetic properties of clusters. As in the 
unrelaxed case, depends strongly on the position of the atom i within 

the cluster. Similar effects are also observed for other sizes and structures, 
for example, for bcc-like Feis as shown in Figure 13. Comparing AF'(^~) 
at different atomic sites i one observes that AFl{^~) does not scale simply 
with the local coordination number Zi. Thus, the effective exchange cou- 
pling between local magnetic moments depends on the local environment, 
as was found at the surfaces of macroscopic TM’s [65]. 

The discussion of F'{^) as a function of a single exchange variable 
does not mean that the local magnetic moment on different atoms fluctuate 
independently of each other. Short-range magnetic correlations are known 
to be present in TM solids, surfaces and clusters even for T > Tc [70]. 
They can be important for determining the cluster magnetic properties 
since they introduce additional couplings between sites with different local 
environments. In the framework of functional-integral theory, short-range 
magnetic order results in correlations between the exchange fields at neigh- 
boring sites. For example, in the ferromagnetic case for 

i and j being nearest neighbors. Particularly interesting finite-size effects 
should occur when the cluster size becomes of the order of spin-spin correla- 
tion length. In this case, the entropy increase leading to a reduction of the 
ferromagnetic order necessarily requires breaking the energetically favorable 
short-range magnetic correlations. 



6.3 Role of electron correlations and structural fluctuations 

Stern-Gerlasch deflection experiments in free transition-metal (TM) clusters 
show that temperature dependence of /I at derived from experiment depends 
qualitatively on the considered TM [24]. The theoretical study of finite- 
temperature magnetic properties is a difficult problem due to the strong 
interdependence of electron correlations, magnetism and cluster structure. 
On the one side, as in solids, one requires an accurate treatment of electron 
correlations in order to determine the electronic excitations, particularly 
when spin fluctuations or changes in the total spin are involved. On the 
other side, in clusters one should take into account possible temperature- 
induced changes or fluctuations of the geometry on which the magnetic 
properties of itinerant electrons are known to depend strongly. 
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In order to investigate this problem we consider the Hubbard 
Hamiltonian 



H = -t Y. 



i 



(6.15) 



For small clusters the eigenstates of equation (6.15) can be calculated ex- 
actly by numerical methods [71]. The finite-temperature properties are then 
derived from the canonical partition function 

Z = (6-16) 

a I 

over electronic and structural degrees of freedom. In particular the average 
total spin (S') is obtained from 

( 6 . 17 ) 

9 I 

In equations (6.16) and (6.17) ei{g) stands for the Zth eigenenergy corre- 
sponding to a cluster structure or geometry g. (3 = l/fceT, where T is the 
temperature of the cluster source that defines the macroscopic thermal bath 
with which the small clusters are in equilibrium before expansion. Thermal 
average refers then to the ensemble of clusters in the beam. Keeping v and 
N fixed (canonical ensemble) corresponds to the experimental situation in 
charge and size selected beams [24] . In order to sample all relevant cluster 
geometries we recall that in the Hubbard model only the topological aspect 
of the structure is relevant to the electronic properties. Taking into ac- 
count only NN hoppings with fixed bond lengths results in a discretization 
of the configurational space so that the sampling of cluster geometries can 
be performed within graph space (see Sect. 5.2). 

In Figure 14 the temperature dependence of the local moments {Sf) and 
of the NN spin-correlation functions = {Si ■ Sj) are shown for clusters 
having N = 6 atoms and v = 7 electrons. Ujt = 64 is representative 
of the limit of strong electron correlations for which the ground state is 
ferromagnetic (FM) with saturated moment S = 5/2. For simplicity we 
first assume that the structure is independent of T. The considered clus- 
ter geometry, shown in the inset, is the most stable one at T = 0. The 
effects of structural fluctuations are discussed below. Figure 14 shows that 
> 0 for all T, which indicates FM order, and that decreases with in- 
creasing T vanishing in the high-temperature limit. The crossover from the 
low-temperature ferromagnetically-ordered state to the high-temperature 
disordered state is significantly broadened by the finite size of the cluster. 
However, a characteristic temperature scale Tc{N) may be identified above 
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Fig. 14. Temperature dependence of the local magnetic moments {Sf) and spin 
correlation functions 'ytj = {Si - Sj) between different NN sites i and j in a clnster 
having N = 6 atoms, v = 7 electrons, and U/t = 64. The results correspond to 
the most stable structure at T = 0 as illustrated in the inset. 



which the FM correlations are strongly reduced by thermal spin fluctua- 
tions. In contrast, the local moments (Sf) remain essentially unaffected 
well above Tq. Taking the inflection point in ^ij{T) as a measure of Tc{N) 
one obtains Tc/t ~ 0.1— 0.2. This corresponds to about 1/100 of the bulk 
band width or to 500—600 K if parameters appropriate for TM’s are 
used (1Tb = 16t for the fee structure and 1Tb = 5—6 eV for TM d-bands). 
Around this temperature the total average magnetization {S) is significantly 
reduced with respect to the low-temperature value S' = 5/2 showing also 
an inflection point (see Fig. 15 for U/t = 64). Furthermore, other physical 
properties also show remarkable changes at T ~ Tc = 0.15t. For example, 
the magnetic susceptibility x('T) has a Curie- Weiss like behavior of the form 
X (X 1/(T — Tc)~^ for T > Tq, and the specifle heat Cp(T) shows a peak at 
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T/t 



Fig. 15. Temperature dependence of the average total spin (S) of clusters having 
N = Q atoms and v = 7 electrons [{S){{S) + 1) = (<5^)]. The results are obtained 
by taking into account either the electronic excitations alone (dashed lines), or 
both electronic excitations and structural fluctuations (solid lines). 



r ~ Tc due to magnetic excitations. These results can be interpreted as a 
precursor of a magnetic phase transition in the infinite solid. 

{Sf) and jij(T) present a very rich environment dependence. The local 
moments decrease with increasing local coordination number Zi, as already 
observed in mean-field calculations. For some bonds {e.g., (i,j) = (5,3)) the 
spin correlations are largest at low T but then decrease quite rapidly with 
increasing T. This corresponds to bonds with the smallest local coordination 
numbers. On other bonds with larger zi {e.g., {i,j) = (6, 1)) the short-range 
magnetic order resists better the effects of temperature fluctuations, despite 
the fact that the local moments and the ground-state spin correlations are 
somewhat weaker. 

The temperature dependence of the average total magnetic moment {S) 
is shown in Figure 15 [{S){{S) -I- 1) = (5^)]. For large U/t (saturated FM 
ground state) {S) decreases monotonically with increasing T tending to a 
remnant value (S')oo > 0 at high temperatures {T > t). {S)oo corresponds 
to an equally probable occupation of all electronic states within the first 
Hubbard band, i.e., for a minimal number of double occupations (fceT <C 
U). In the present case {S)od = 1-5. The classical analogous of {S)od 
is the average of N random spins which does not vanish in a finite system 
((Moo) = Mo/V^)- The difference {S) — {S)od is a measure of the importance 
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of short-range magnetic order [70]. For smaller values of U jt corresponding 
to a non-saturated FM ground-state - for example, S = 3/2 for U /t = 
16 - a completely different, non-monotonous temperature dependence of 
{S) is found. Here, one observes first an increase of {S) with increasing 
T, for T < 0.05t, followed by a decrease towards the high-temperature 
limit (S')oo- The increase of (S) at low T results from populating low- 
lying excited states which have higher S than the ground state [if(5' = 
5/2) — E{S = 3/2) = 0.02t for U/t = 16]. If the temperature is further 
increased, {S) decreases since the FM correlations are destroyed in a similar 
way as for U/t = 64 (see Fig. 14). It is interesting to observe that a 
weak increase of the average magnetization per atom with increasing T 
has been experimentally observed in large Cojv clusters (TV ~ 50—600 and 
T ~ 400) [24]. The temperature dependence of {S) in non-saturated FM 
clusters is very sensitive to the value of U/t. If U/t is increased beyond 
U/t = 16, the S = 5/2 states are stabilized with respect to the quartet 
and the maximum in {S){T) shifts to lower temperatures. For sufficiently 
large U/t the sextet becomes the ground state. In contrast, at smaller 
U/t {e.g., U/t = 12) the excitation energies to high-spin states increase 
and the low-lying excitations correspond to minimal S. Consequently, {S) 
decreases rapidly at low T showing eventually a minimum at intermediate 
T. The strong sensitivity of the spin excitation spectra on U/t reflects the 
importance of electron correlations to the finite temperature behavior. 

The solid curves in Figure 15 are obtained by including temperature- 
induced changes of structure (see Eqs. (6.16) and (6.17)). Structural fluc- 
tuations play no role at low temperatures where the ground-state structure 
dominates {T/t < 0.05). However, at higher T they contribute to a more 
rapid decrease of {S). In order to analyze this effect one may compute 
the temperature dependence of the average coordination number (z) . With 
increasing T, (z) decreases since breaking NN bonds becomes increasingly 
probable. The reduction of (z) is more important for U/t = 16 than for 
U/t = 64 since ferromagnetism tends to stabilize the more compact struc- 
tures [16]. Consequently, the effect of structural fluctuations on {S){T) is 
somewhat stronger for moderate U/t, where the ground-state magnetic mo- 
ments are not saturated. The reduction of {S) as a result of structural 
fluctuations may be qualitatively understood as follows. In open structures 
the band width for bonding (negative energy) states is smaller. Therefore, 
the eigenstates with maximal spin, whose energy is dominated by the mi- 
nority electrons, are comparatively less stable than lower spin states. In 
other words, the spin-flip energy AE = if(S'max — 1) — E{Smax) tends to 
decrease with decreasing coordination number. Thus, {S)(T) is reduced as 
more open structures are populated. Notice, however, that more complex 
structural effects may be expected for other v/N and U/t. For example. 
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one may find structures having a higher energy and a higher ground-state 
spin S than the optimal structure at T = 0 [16]. In such cases taking into 
account structural fluctuations could tend to increase {S){T). Summariz- 
ing, thermal-induced structural fluctuations play an important role in the 
temperature dependence of cluster magnetic properties. 

7 Conclusion 

The different problems discussed in this chapter reflect the remarkable ac- 
tivity in the field of cluster magnetism and suggest possible future studies. 
We have seen, for example, that the magnetic moments in free 3<i transition- 
metal clusters are enhanced with respect to the corresponding solids. This 
has been related to the reduction of the local coordination number and to 
the resulting changes in the electronic structure. In contrast, when the same 
clusters are embedded in a non-magnetic metallic matrix their magnetic mo- 
ments are smaller than the bulk magnetization per atom, a consequence of 
strong cluster-matrix hybridizations. In the case of magnetic clusters de- 
posited on surfaces the interplay between reduction of local coordination 
number and cluster-substrate hybridization effects should reveal a wide va- 
riety of interesting behaviors. The properties of clusters, islands and dots 
on surfaces as a function of size and structure motivate a considerable ex- 
perimental and theoretical activity, particularly in view of developing new 
magnetic materials for applications in high-density recording and memory 
devices. 

Electron-correlation phenomena and their temperature dependence are 
subjects of fundamental importance in cluster research where many open 
questions remain to be elucidated. Methodological improvements on both 
theoretical and experimental methods are worthwhile and necessary. For 
instance, a systematic theoretical study of the photoemission spectra of 
TM clusters would allow to relate the size dependence of excitation spectra 
and magnetic properties, thus providing valuable information that would 
complement the results of Stern-Gerlasch deflection experiments. The de- 
termination of electronic excitations motivates the development of an im- 
proved treatment of many-body effects. Rigorous calculations using realistic 
d-band Hamiltonians and exact diagonalization methods on small clusters 
could serve as a basis for testing approximate methods applicable to larger 
systems. The analysis of the evolution of the many-body electronic structure 
as a function of size would also improve our understanding of fundamental 
properties such as the spin and orbital magnetic moments, the magnetic 
anisotropy and the transition from localized to itinerant magnetism. 

The enhancement of the local magnetic moments in low-dimensional sys- 
tems and the presence of significant short-range magnetic order above the 
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Curie temperature Tq{N) are two characteristics of TM magnetism which 
play a significant role for determining the finite temperature behavior. In 
the case of clusters, changes or fluctuations of structure can also be very 
important. The spin- fluctuation theory developed in this chapter is an ap- 
propriate means of addressing the main issues of this problem, since the 
local fluctuations of the magnetic moments and the itinerant character of 
the d-electron states are taken into account at the same electronic level. 
Presently, more systematic studies of finite-temperature properties are nec- 
essary, in particular concerning the size dependence of the cluster Curie tem- 
perature Tc{N). Monte-Carlo algorithms could be applied to perform the 
functional integration over exchange fields, including contributions beyond 
the usual single-site approximations. For large clusters and cluster-based 
nanostructures simpler spin models could be formulated by deriving effec- 
tive spin-spin interactions from the electronic calculations. In this context 
the non-adiabatic relaxation dynamics of spin excitations deserves special 
attention. Small clusters would be also useful as test models for investigat- 
ing non-equilibrium magnetic processes such as the dynamics of recording 
within a microscopic theory. 

In conclusion, cluster magnetism offers a challenging research area that 
will continue to reveal a series of novel phenomena, fascinating from the 
fundamental standpoint and potentially important for technological appli- 
cations. 

It is a pleasure to thank the organizing committee and staff of the Les Houches Summer 
School of Theoretical Physiscs for their kind hospitality and support. Numerous fruitful 
discussions with Prof. J. Dorantes-Davila are most gratefully acknowledged. 
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AND FULLERENES 



A.V. Solov’yov 



Abstract 

This paper gives a survey of physical phenomena manifesting them- 
selves in electron scattering on atomic clusters. The main emphasis is 
made on electron scattering on fullerenes and metal clusters, however 
some results are applicable to other types of clusters as well. This 
work is addressed to theoretical aspects of electron-cluster scattering, 
however some experimental results are also discussed. It is demon- 
strated that the electron diffraction plays important role in the for- 
mation of both elastic and inelastic electron scattering cross sections. 

It is elucidated the essential role of the multipole surface and volume 
plasmon excitations in the formation of electron energy loss spectra 
on clusters (differential and total, above and below ionization poten- 
tial) as well as the total inelastic scattering cross sections. Particular 
attention is paid to the elucidation of the role of the polarization in- 
teraction in low energy electron-cluster collisions. This problem is 
considered for electron attachment to metallic clusters and the plas- 
mon enhanced photon emission. Finally, mechanisms of electron ex- 
citation widths formation and relaxation of electron excitations in 
metal clusters and fullerenes are discussed. 

1 Introduction 

Clusters have been recognized as new physical objects with their own prop- 
erties relatively recently. This became clear after such experimental suc- 
cesses as the discovery of electron shell structure in metal clusters [1,2], ob- 
servation of plasmon resonances in metal clusters [3-5] and fullerenes [6,7], 
formation of singly and doubly charged negative cluster ions [8] and many 
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more others. Complete review of the field can be found in review papers 
and books, see e.g. [9-14]. 

The novelty of cluster physics is greatly connected with the fact that 
cluster properties explain the transition from single atoms or molecules to 
the solid state. New possibilities in experiment made it possible to study this 
transition. Increasing cluster size, one can observe the appearance of the 
new physical features in the system, such as plasmon excitations, electron 
conduction band formation, superconductivity and super liquidity, phase 
transitions, fissioning and many more others. Most of these many-body 
phenomena exist in solid state and are absent in a single atom. 

Clusters can naturally exist in all forms of matter: solid state, liquid, 
gas and plasmas and often collision processes involving atomic clusters play 
an important role in understanding properties of various physical systems. 
For example, positive and negative cluster ions do exist in plasmas and 
understanding of mechanisms of their creation and decay is essential for the 
description plasma properties and its diagnostics. 

Properties of clusters can be studied by means of photon, electron and 
ion scattering. These methods are the traditional tools for probing prop- 
erties and internal structure of various physical objects. There are many 
different types of clusters, such as metallic clusters, fullerenes, molecular 
clusters, semiconductor clusters, organic clusters, quantum dots, positively 
and negatively charged clusters, which can be studied by these means. This 
makes the number of various physical situations to be examined quite large. 
Therefore when considering electron cluster collisions one needs primarily 
to focus on certain specific cases and try to recognize physical phenomena 
relevant for various types of clusters. 

Following this line we consider electron collisions with metal clusters 
and fullerenes, being in a gas phase, and focus on the following physical 
problems: manifestation of electron diffraction both in elastic and inelas- 
tic collisions [15-18], the role of surface and volume plasmon excitations in 
the formation of electron energy loss spectra (differential and total, above 
and below ionization potential) as well as the total inelastic scattering cross 
sections [15-19], importance of the polarization effects in electron attach- 
ment process and photon emission processes [20-30] . We also discuss briefly 
mechanisms of electron excitation width formation and relaxation of elec- 
tron excitations in metal clusters [31-33]. The choice of these problems is 
partially made, because of their links with experimental efforts performed 
in the field. 

The outlined problems can be tackled using various approaches. In this 
lecture solutions of the outlined problems are given on the basis of the con- 
sistent many-body theory developed with the Hartree-Fock jellium model 
wave functions. Many electron correlations in the system are taken into 
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account, where it is necessary, using the random phase approximation with 
exchange (RPAE) and the Dyson equation method. These approaches are 
very well known in atomic and nuclear physics. Their effective use for clus- 
ters is one of the great advantages of the jellium model, which we briefly 
discuss as a basis for the electron scattering theory on metal clusters and 
fullerens. 

In this paper the atomic system of units, = |e| = We = 1, is used. 



2 Jellium model: Cluster electron wave functions 



Metallic clusters are characterized by the property that their valence elec- 
trons are fully delocalized. To some extent this feature is also valid for 
fullerenes, where the delocalization of electrons take place on the surface 
in the vicinity of the fullerene’s cage. When considering electron collisions 
involving metal clusters and fullerens, often, namely the valence delocalized 
electrons play the most important role in the formation of the cross sections 
of various collision processes. Therefore, an adequate description of such 
processes is possible to achieve on the basis of the jellium model. 

Let us briefly discuss the main postulates and the results of the jellium 
model essential for our further consideration. Under different experimen- 
tal conditions, the detailed ionic structure has been found not to affect the 
properties of alkali and other simple metal clusters very much (see e.g. [9-11] 
for a review). This behavior suggests the validity of a jellium model, defined 
by a Hamiltonian which treats the electrons in the usual quantum mechan- 
ical way, but approximates the ionic density n(r) as a uniform positively 
charged background: n{r) = ^0(i? — r), where V = is the cluster 

volume, R = is the cluster radius, Vg is the Wigner-Seitz radius, 

N is the number of atoms in a cluster, Q{R — r) is the theta-function. 

The mean filed of ions in this approximation is equal to 



U{r) = 



~^,r>R. 



( 2 . 1 ) 



This model leads to a description of the electron density in terms of single 
electron wave functions that extend over the entire cluster. The impor- 
tant consequence of this approach is the electronic shell structure in free 
alkali clusters, which was described theoretically and observed experimen- 
tally in [1,2]. 

Initially, jellium calculations for metal clusters were based on the density 
functional formalism using pseudopotentials for description of the electron 
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relaxation effects and lattice structure [34]. Fully self-consistent calcula- 
tions for the spherical jellium metal clusters have been performed in the 
framework of the spin-density-functional formalism [35] and the Kohn- 
Sham formalism for self-consistent determination of the electron wave func- 
tions [2,36]. 

Later the spherical jellium model for metal clusters based on the 
Hartree-Fock scheme of the self-consistent determination of the electron 
wave functions has been developed [37-39] . In spite of the significant differ- 
ences in the theoretical justification of the many-body theory based on the 
use Hartree-Fock electron wave functions and the local density approxima- 
tion, the predictions of two theories are rather similar [40] . 

Shortly after the discovery of electronic shell structure in free alkali clus- 
ters [1,2] it was realized that the detailed fV-dependencies of ionization po- 
tentials and other characteristics of small metal clusters can be understood 
as a consequence of non-spherical cluster shapes [41] by analogy with the 
nuclear shell model. A direct evidence for cluster deformation was achieved 
in experiments on photoabsorption, where splitting of plasmon resonances 
caused by the cluster deformation had been observed (see [9-11] and refer- 
ences therein). 

Kohn-Sham calculations for spherical [2, 36] metal clusters have been 
generalized for spheroidal [42,43] and more general axial shapes of light clus- 
ters [44-50] . Light clusters of arbitrary shapes have been studied by means 
of the ultimate jellium model [45,46]. It has been shown that alkali-metal 
clusters have similar shapes with small atomic nuclei [51]. In [33] shapes 
of deformed axially symmetric metal clusters have been studied within the 
framework of the Hartree-Fock jellium model. 

Recently, the dynamic jellium model for metal clusters, which treats 
simultaneously the vibration modes of the ionic jellium background, the 
quantized electron motion and the interaction between the electronic and 
the ionic subsystems, has been suggested [31-33]. Quantized electron mo- 
tion has been described on the basis of the Hartree-Fock approach. This 
model allowed to describe the widths of electron excitations and relaxation 
times of electron excitations in metal clusters beyond the adiabatic approx- 
imation. 

Usually, the Hartee-Fock approximation serves as a basis for a system- 
atic development of many-body (z.e. many-electron) theory. In general, 
the Hartree-Fock method itself combined with such methods as the ran- 
dom phase approximation with exchange (RPAE) [52], the Dyson equation 
method [53] and the consistent many-body theory provides one of the most 
fundamental and powerful approaches for the description many body phe- 
nomena in atomic, molecular and cluster physics. In cluster physics such 
program has been fulfilled in full for spherically symmetric metal clusters 
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in a number of papers cited above. For deformed metal clusters analysis of 
many-electron correlations on the basis of the Hartree-Fock jellium model 
has not been performed so far and is a subject for further efforts. 

Finally let us stress that many-body theory developed with the Hartree- 
Fock jellium model wave functions allows one to construct models on the 
basis of fundamental physical principles, which can then be refined by ex- 
tending the quality of the approximations. Alternatively, one can construct 
pseudo-potentials, exchange-correlation functionals, etc., which of course do 
exist in principle as unique quantities but are not actually understood, so 
that they serve as a much less satisfactory basis for achieving a physical 
interpretation. 



3 Diffraction of fast electrons on clusters: Theory and experiment 

The jellium model of metal clusters and fullerenes can be examined in elec- 
tron elastic scattering of fast electrons on metal clusters and fullerenes. 
Indeed, the jellium model implies that there is a rigid border in the ionic 
density distribution of a cluster. The presence of a surface in a cluster re- 
sults in the specific oscillatory behaviour of the electron elastic scattering 
cross sections, which can be interpreted in terms of electron difraction of 
the cluster surface [15,18]. 

The detailed theoretical treatment of the diffraction phenomena aris- 
ing in electron scattering on metal clusters and fullerenes has been given 
in [15-17]. Experimentally, diffraction in electron elastic scattering cross 
sections on fullerenes in the gas phase has been observed for the first time 
in [18]. 

Let us explain the physical nature of the diffraction phenomena arising 
in elastic electron-cluster scattering on the example of fast electron scatter- 
ing on the fullerene Cqo. Due to the spherical-like shape of the Ceo molecule, 
the charge densities of electrons and ions near the surface of the fullerene 
are much higher than in the outer region. These densities are characterized 
by the radius of the fullerene R and the width of the fullerene shell, a R. 
The de Broglie wave length of a fast electron in collision with fullerene is 
small compared to R. At the same time the characteristic scattering length, 
1 /q, where q is the transferred momentum can be larger than the average 
inter-atomic distance in Cqq, which is approximately as large as a. Under 
these conditions the electron scattering amplitude becomes determined by 
the charge density of the entire fullerene rather than the charge density of 
a single carbon atom. In this limit, the scattering amplitude and the corre- 
sponding cross section possesses a series of diffraction maxima and minima. 
The positions of these minima and maxima are mainly determined by the 
radius of the molecule. In this case the process of electron elastic scattering 
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on Cqo becomes qualitatively similar to the diffraction of an electron at a 
shell. The similar behaviour occurs for other type of clusters as well. 

The cross section of elastic scattering of a fast electron on Ceo in the 
Born approximation (see e.g. [54]) reads as 

Here F{q) is the form-factor of Cqq, 9 = |p — p'j is the momentum transfer 
and p, p' are the momenta of the electron in the initial and the final states 
respectively. 

For the description of the fullerene’s form factor let us express F{q) as 
product of the form factor of the atomic concentration n{q) and the form 
factor of a single carbon atom F^iq)- 

F{q) = FA{q)Y^exp{iqrj) = FA{q)n{q), (3.2) 

i 

where the summation is performed over all coordinates, r^, of carbon atoms 
in the fullerene. The applicability of this approximation has been examined 
in [18] for metal clusters and fullerenes. The form-factor of an isolated car- 
bon atom, FA{q), calculated in the Hartree-Fock approximation decreases 
as a function of q on the scale g ~ 1. The form-factor of atomic concentra- 
tion, n{q), characterizing the geometry of Cqq, varies more rapidly. Indeed, 
for the momenta transfer smaller than an atomic unit, q ^ 1, the model 
of the homogeneous distribution of carbon atoms over the fullerene sphere 
can be used. In this model n{q) = N sin{qR) / qR oscillates with the period 
q = 27r/i? ^ 1. Namely these oscillations form the diffraction pattern of 
the differential cross Section (3.1). The angular dependence of the cross 
Section (3.1) for the projectile electron energy e = 809 eV is shown in 
Figure 1 by solid line. It possesses a series of diffraction maxima and 
minima [18]. Experimental data points derived in [18] in the two sets of 
measurements are shown in Figure 1 by open and solid circles. The cross 
section dependence obtained theoretically is shown in Figure 1 by a solid 
line. Experimental data have been normalized to the theoretical cross sec- 
tion at the second diffraction maximum [6 = 5°). Figure 1 shows quite good 
agreement of the experimental and the theoretical results in position of the 
first and the second maxima. The entire pattern of the differential cross 
section obtained theoretically is very similar to that from the experiment. 

In the vicinity of diffraction maxima at 0 < 10° the cross section greatly 
exceeds the elastic scattering cross section on the equivalent number of 
isolated atoms because of the coherent interaction of the projectile electron 
with the fullerene sphere. In the region 9 > 10°, where 9 > 1, the projectile 
electron scatters on individual carbon atoms of the fullerene rather than on 




A.V. Solov’yov: Electron Scattering 



409 




Fig. 1. Experimental (full and open circles) and theoretical (solid curve) angular 
dependencies of the differential elastic scattering cross section in collision of 809 eV 
electron with the Ceo molecule [18]. Full and open circles correspond to the two 
independent sets of measurements. Dashed line is the differential cross section for 
the mixture containing 60% of Ceo and 40% of equivalent isolated carbon atoms. 
The scale for this curve is given in the right hand side of the figure. 



the entire fullerene sphere. Therefore diffraction features of the cross section 
in the region 0 < 10° are much more pronounced than in the region 0 > 10°. 

Note that in the region 0 < 10°, where g < 1, theoretical cross section has 
zeros while experimental one does not. The presence of zeros at g « nk/ R < 
1 where k is integer, in theoretical curve is the consequence of the coherent 
scattering of electron on the fullerene sphere. However in experiment, zeros 
in the cross section can disappear because of various reasons. For example, 
this can occur due to the presence of carbon atoms or some other impurities 
in the gas cell. Figure 1 shows the differential cross section for the mixture 
containing 60% of Cqo and 40% of isolated carbon atoms by dashed line. 
The differential electron elastic scattering cross section on single carbon 
atoms does not have diffraction oscillations and thus it forms the smooth 
background removing zeroes in the angular dependence of the cross section. 

Finally, let us note that similar considerations are applicable to other 
types of clusters as well. For metal clusters they have been performed in [15] . 

4 Elements of many-body theory 

Metal clusters and fullerenes possess prominent dynamic properties. Due 
to the presence of the highly movable delocalized valence electrons in the 
system, these clusters are higly polarizable. Collective oscillations of the 
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delocalized electrons can be excited in a cluster by a photon or a projectile 
charged particle (electron or ion) . These collective excitations are known as 
plasmons. Polarization effects are especially strong, when considering low- 
energy electron cluster collisions, when the velocity of the projectile electron 
is comparable with the energy of the delocalized cluster electrons. Multipole 
plasmon excitations is easier to study in the fast-electron cluster collisions. 
Note that the dipole plasmon excitation mode can also be effectively probed 
in the photoabsorption or photoionization process. 

The dynamic properties of metal clusters are mainly determined by the 
delocalized cluster electrons and thus can be very well described on the 
basis of the jellium model. Therefore, let us consider here the essential 
elements of the many-body theory developed on the basis of the Hartree- 
Fock jellium model, which are necessary for the understanding of electron 
inelastic scattering processes described in further sections. 

For this purpose let us analyze the transition amplitude Mfi defined as 
follows 



Mf. 



4'/(r')4'„(r) 






(4.1) 



This amplitude describes the transition of the projectile electron from the 
state to the state dr / with simultaneous excitation of the target electron 
from the state 'i’m to 'i’n- This transition is caused by the Coulomb inter- 
action between the electrons. We assume that the final states 'i’n and dr / 
of the electrons can either belong to continuous or discrete spectrum. 

Such an amplitude or similar arises, when considering electron-cluster 
collisions on the basis of the perturbation theory. Depending on the final 
states of the particles this amplitude can describe either inelastic electron- 
cluster collision or electron attachment process. 

The bound states for the extra electron in the field of the positive cluster 
can be calculated within the frozen-core Hartree-Fock approximation and 
by accounting independently for the cluster dynamic polarization potential. 
Bound states in the system of extra electron plus neutral target cluster 
do not exist if consider the system in the HF approximation. They ap- 
pear however, when accounting for the polarization interaction between the 
electron and the cluster. The negative ion wave functions and the ener- 
gies En can be obtained as Dyson’s equation with the non-local polarization 
potential [53]: 






Se„(r,r') T„(r')dr' = £„4'„(r). 



(4.2) 



Here, is the static single-particle Hamiltonian of the cluster, SE(r, rO is 
the energy-dependent non-local potential, which is equal to the irreducible 
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self-energy part of the single-electron Green’s function of the system cluster 
-I- electron. SE(r,rO can be represented diagrammatically as a series on the 
inter-electron correlation interaction (see [26,53]). It is natural to calculate 
SE(r,r') and solve equation (4.2) by using the eigen single-particle wave 
functions ipi of the Hartree-Fock Hamiltonian: 

i7(°V.(r) = e.V’^(r) (4.3) 



where 



H^°Vi(r) 



A OCC \ 

-^-U{r) + J2 J dr' J V'*(r) 

OCC p - 

j V’j(r') |r-r^| V’»(r') dr'V>j(r). (4.4) 



Here, U{v) is the potential of the positive cluster core defined in (2.1). 
The exchange interaction in equation (4.4) is taken into account explicitly, 
which makes the potential in (4.4) non-local contrary to the local density 
approximation in which the exchange correlation interaction is always local. 

Let us note that when the collective electron excitations in a cluster be- 
come important, one should treat the Coulomb many-electron correlations 
properly in order to calculate the matrix element (4.1) or similar correctly. 
For this purpose, let us treat the matrix element (4.1) and excitation ener- 
gies LOfi in the RPAE scheme using the Hartree-Fock wave functions calcu- 
lated within the jellium model as a basis. This method similar to the one 
used in the dipole case [37] for photo-absorption by metal clusters. 

The transition amplitude Mfi = (/|G|z) describes the transition between 
the ground state and the excited state It can be expressed in the 
RPAE as a linear combination of the forward-propagating, and the 

back-propagating, Y^p amplitudes. This relationship reads: 



G 






4- {rn \g\ j) + yI) {j \g\ m 



(4.5) 



Here, the single particle operator g is equal to f (r')4rj(r')dr' and 

the many-body operator G is defined as / j?73Trj'k„(r')4'p(r')dr'. The 
eigenstates \j >, |m > in (4.5) are the solutions of the single particle 
Hartree-Fock equations. Systems of coupled non-linear integro-difierential 
Hartree-Fock equations for the occupied orbitals in the ground state have 
been solved numerically for clusters of different size in [37]. 
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The forward-propagating, and the back-propagating, Y^p ampli- 

tudes are obtained by solving the RPAE equation: 




where matrices A and B are defined as follows 

Arai^nj — (^m ^ij^mn \Y\ ifl) , 



Bmi,nj = {mn \V\ ij ) , 



and V is defined as 

f V’m(r)V’«(r)V^r(rOV’fc(rQ - (r')V>fc(r) 

J |r-r'| 

Note that matrices A and B represent the time-forward and the time- 
reversed diagrams respectively. Using standard methods, one can extract 
the angular dependencies from equation (4.6). 

The numerical solution of the RPAE equation requires a complete single- 
particle basis. Numerically, the RPAE equation is easier to solve when 
using a discrete basis \j >, \m >. In this case the RPAE equation (4.6) 
reduces to the matrix equation of the eigen- value problem. The problem of 
transformation of the complete basis of single-particle states, which includes 
both the discrete and continuous spectra, to an equivalent complete basis 
of discrete states can be solved by the B-spline method [55] . 

5 Inelastic scattering of fast electrons on metal clusters 

Let us now consider the inelastic scattering of fast electrons on metal clusters 
and fullerenes, using approaches and methods described in the previous 
section. This process is of interest because the many-electron collective 
excitations of various multipolarity provide significant contribution to the 
cross section as demonstrated in [15-18]. 

Plasmon excitations in metal clusters and fullerenes have been inten- 
sively studied during last years (see e.g. [9-12,23,56-58]). They were ob- 
served in photo-absorption experiments with metal clusters [3-5] and in 
photo-ionization studies with the fullerenes [6,7]. These studies were con- 
tinued by a large number of experimental and theoretical works (see reviews 
cited above) . In photoionization experiments with metal clusters only dipole 




A.V. Solov’yov: Electron Scattering 



413 



collective excitations have been investigated. The same concerns the en- 
ergy loss experiments mainly performed with the fullerene films for various 
charged particles, see e.g. [59]. These experiments studied the region of rel- 
atively small electron scattering angles and, therefore, only dipole collective 
excitations in the target have been probed. 

Electron collective modes with higher angular momenta can be excited 
in metal clusters and fullerenes by electron impact if the scattering angle 
of the electron is large enough [15-18]. The plasmon excitations mani- 
fest themselves as resonances in the electron energy loss spectra. Dipole 
plasmon resonances of the same physical nature as in the case of the photo- 
absorption, dominate the electron energy loss spectrum if the scattering 
angle of the electron, and thus its transferred momentum, is sufficiently 
small. With increasing scattering angle plasmon excitations with higher 
angular momenta become more probable. The actual number of multipoles 
coming into play depends on the cluster size. 

Collisions of electrons with metal clusters and fullerenes have been stud- 
ied both experimentally and theoretically during the last years. Collisions of 
fast electrons with fullerenes were experimentally studied in [60]. Theoret- 
ically this problem was tackled for metal clusters and fullerenes in [15] and 
independently for fullerenes in [61]. Ab initio many-body quantum treat- 
ment of inelastic scattering of fast electrons on metal clusters have been 
performed in [16,17]. In these papers both differential and total cross sec- 
tions have been calculated. The total inelastic cross sections in electron 
collisions with metal clusters have been measured in [27,28]. In electron- 
fullerene collisions, the measurements of the total inelastic scattering cross 
sections have been performed in [62-65]. 

Now let us consider the cross section of the scattering process: 

d^CT = ^ |M/,(p,p')|^ (5 -f £/ - Y - 

where p and p' are the initial and the final momenta of the projectile elec- 
tron, £i and £/ are the energies of the initial and the final states of the 
cluster, the integration over df implies the summation over the discrete 
spectrum and the integration over the continuous spectrum of the final 
states of the cluster. 

Let us describe the initial and the final states of the electron by plane 
waves. This assumption is valid when the collision velocity is much larger 
than the characteristic velocity of the delocalized electrons in a cluster (for 
estimates see [15]). 
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The amplitude M fi of the process in the plane wave Born approximation 
reads as 




where are the coordinates of the delocalized electrons in the cluster, 
q = p — p'. The wave functions dTi, are the initial and the final states of 
the cluster. Note that namely the many-electron wave function carries 
all the information about the collective plasmon excitation in a cluster. 

Now let us perform the multipole expansion of the exponents in (5.2) 
(see e.g. [66]) 











^ji{qra)Yi„,{na) 




(5.3) 



If we consider spherical clusters with the zero total angular momentum 
of the ground state dTj, the contributions of different multipolarity do not 
interfere in the cross section (5.1). In this case, substituting (5.3) to (5.1) 
and performing simple transformations, we come to the following expression 
for the triply differential inelastic scattering cross section 



d^cr 

de'dfl 



167rp' 



^(2; + l)^ 



Im 






'^jl{qra)Ylm{ria) 




2 




Here, dH is the solid angle of the scattered electron. The summation over 
/ implies the summation over the discrete spectrum and the integration 
over the continuous spectrum of the final states of the cluster. We calculate 
the many electron wave functions dTj, dry and the excitation energies £/ 
and £i, using the Hartree-Fock jelium model. As soon as collective electron 
excitations in a cluster play the significant role, then in order to obtain 
the correct result when calculating the matrix elements in (5.4) one should 
properly take into account many-electron correlations. This problem can 
be solved in the RPAE described in the previous section. 

Integrating the triply differential cross Section (5.4) over dH, we derive 
the total differential energy loss spectrum 



dcr 

de' 




dq 



^(2Z + 1) 




^il{qra)Ylm{ria) 




2 
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(5.5) 



Integrating this equation over the transferred energies of the electron, we 
come to the expression for the total cross section of inelastic scattering 



dq 



_ 327t2 ^ /•«”“ 

^ f qmin y 









^jl{qra)Ylm{na) 






(5.6) 



The minimum and the maximum transferred momenta in (5.5) and (5.6) are 
equal to gmin = p(l - a/T^^wT^) and g^ax = p(l + a/1 - ujfije). Here, 
£ f and Si are the energies of the levels i and / respectively, e is the energy 
of the projectile electron. The contributions of different multipolarity do 
not interfere in (5. 4-5. 6), which is the result of spherical symmetry of the 
target cluster. 



6 Plasmon resonance approximation: Diffraction phenomena, 
comparison with experiment and RPAE 

Besides complex numerical calculations of the cross section (5. 4-5. 6) one 
can derive rather simple approximate analytical results giving the distinct 
physical picture of the process [15]. Indeed, let us consider the behaviour of 
the inelastic cross sections in the vicinity of the giant collective resonance 
when surface plasmon excitations give the main contribution. In this case 
the interaction of the projectile with electrons in the surface layer of the 
width a near the surface of the cluster should play the most significant role 
in the inelastic scattering process. The width a of this layer is determined 
by the width of the region near of the surface of the cluster where oscil- 
lations of electron density mainly occur. This width is of the same order 
of magnitude as the size of a single atom. Oscillations of electron density 
take place mainly near the surface, because the electron density inside the 
cluster is well compensated by the oppositely charged density of the ionic 
background. Mathematically this means that theory has a small param- 
eter, namely a/R <C 1, which allows one make the simplification of final 
results. Indeed, in the case of the collective excitation the main contribu- 
tion to the matrix elements in (5. 3-5. 6) arises from the Va ~ R- This feature 
is known for the dipole matrix elements from the photoabsorption studies 
(see e.g. [58]). 
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The condition Va ~ R allows us to simplify the matrix elements in (5.3) 
and express them via the multipole matrix elements as 






where 



^il{qra)Ylm{ria) 

a 

<3‘/f = / 






2l+lji{qR) 

/^Im fa ^\ 



y ] ^aYlm(na) 






21 -t~ 1 

Substituting these equations to (5.3) and using the relationship 



J |Q/?r -£i- £/)d/ = i/ma;(w), 



( 6 . 2 ) 



where ai(oj) is the multipole dynamic polarizability of the cluster we finally 
obtain the expression of the cross section via the imaginary parts of the 
multipole dynamic polarizabilities 



d^cr 

de'dn 



4p' 

Trpq"^ 



^(2(+l)»^/™a,(Ae). 



(6.3) 



Here, As = p^/2 — p'^/2. 

Integrating the cross section (6.3) over dH, we derive the expression for 
the total spectrum of the electron energy loss 

d(J 8 ^ — A (21 -j- 1)^ \ \ T /A\ /r- A\ 

J 121-2 Si(Ae)Imai(Ae). (6.4) 



Here, Ae = s — s'. We have also introduced functions Si{As) as follows 

PQm&xR 

Si(Ae) = / -jKx). (6.5) 

J qminR ^ 

The li mits q^in a nd gr^ax are equal to q^in = p(l - ^/T^^Ae/s), gmax = 
p(\ + ~ Ae/e). In the region the most interesting for our consideration, 

one derives As ~ Up and e ^ UpR^ . Therefore, Ae/e <C I and the limits of 
integration in (6.5) are approximately equal to gmin ~ Ae/v and gmax ~ 2p. 
In this case qmaxR !> the upper limit in (6.5) can be replaced by the 
infinity and Si becomes a function of AeR/v <C 1. 

Equations (6. 4-6. 5) establish the connection between the inelastic scat- 
tering cross section and the imaginary parts of the multipole dynamic po- 
larizabilities. These polarizabilities have a resonance behavior in the region 
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of frequencies where collective electron modes in a cluster can be excited. 
In the plasmon resonance approximation (see e.g. [15,58]) the polarizability 
ai{u>) can be written as 



ai{uj) = R 



- ??2/+i 



jJ — uP- — iuT I ’ 



( 6 . 6 ) 



where oji is the resonance frequency of the plasmon excitation with the 
angular momentum I (we have omitted index p when defined u>i), which 
for metal clusters according to the Mie theory (see e.g. [58]) is equal to 

oji = ^ ( 21 ^)^ ~ \J ( 2 i+i)fl ^ fullerenes [15,61]. Here, is the 

number of delocalized electrons in the cluster. The parameter T/ in (6.6) is 
the width of the plasmon resonance with the angular momentum 1. 

These formulae demonstrate that due to the resonance behavior of the 
polarizability, the differential inelastic scattering cross section should also 
exhibit resonances, if the transferred energy lies in the range characteristic 
for plasmon excitations. 

Integrating (6.4) over e' , we derive the plasmon contribution to the total 
inelastic cross section. Assuming that the plasmon resonance is narrow 
enough and using the pole approximation (6.6), we obtain the following 
expression for the plasmon contribution to the inelastic cross section 



a = ^ y~!(2^ + \YuJiSi{iOi). (6.7) 

V ^ ' 

i 



Note that excitations with large enough angular momenta I have a single 
particle nature rather than collective character. It follows, for instance, 
from the fact that with increasing I the wave length of the surface plasmon 
mode, 2ttR/1, becomes smaller than the characteristic wave length of the 
delocalized electrons on the surface of the cluster, 27 t/-\/^, where £ is the 
characteristic kinetic electron energy in the cluster. Another words, excita- 
tions with the angular momenta comparable or larger than the characteristic 
electron angular momenta of the ground state should have a single particle 
character rather than collective nature. Therefore analyzing contribution of 
the collective modes, which have prominent resonance character, we should 
restrict the consideration only by relatively low angular momenta. For ex- 
ample, according to the jellium model the maximum angular momentum 
of the delocalized electrons in the Na 4 o cluster is equal to 4. Therefore, 
only the dipole, the quadrupole and the octupole collective modes can be 
expected in this case. This means that only the first three terms in (6.6) 
should be considered. With increasing cluster size the number of collective 
modes grows as R. 

The plasmon resonance approximation and its classical (see [67,68]) and 
quasi-classical (see [69,70]) modifications can be used for the estimation of 
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the cross sections and investigation their dependence on various parameters 
even for the large clusters having hundreds and thousands of atoms, when 
ab initio calculations are hardly possible. For the small cluster systems its 
validity can be verified by performing the ab initio quantum mechanical 
calculation of cross sections. Such calculation for the electron collision with 
the Na 2 o, Na 4 o, Nasg and Nag 2 clusters have been performed in [16,17]. 
In both approaches we have described the collision process in the Born 
approximation, which is applicable when the energy of the projectile electron 
surpasses typical energy of the delocalized electrons. Comparison of the two 
approaches demonstrates that collective excitations of electrons in a cluster 
provide dominating contribution to inelastic scattering cross sections and 
explain their most important features. 

The resonance plasmon approximation provides a simple criterion for 
the estimation of the relative importance of various plasmon modes. In- 
deed, according to the resonance plasmon approximation the minima and 
the maxima of the contribution of the plasmon mode with the angular mo- 
mentum I are determined by the minima and the maxima of the diffraction 
factor jf{qR) as it follows from (5.6). This factor shows that diffraction 
phenomena arise also in electron inelastic scattering on clusters. The main 
maximum of the partial contribution with the angular momentum I arises 
when qR ~ 1. This condition reflects a simple fact that the probability 
of the excitation of the collective plasmon mode is maximum, when the 
characteristic collision distance is about the wave length of plasmon. This 
results in the significant dependence of the profile of the spectrum on the 
angle of the scattered electron. 

Let us consider now the behaviour of the cross section for the electron- 
fullerene Cqo collision. In Figure 2 (left pannel) we compare the calculated 
differential cross section, dcr/de'dn, (solid line) with the experimental elec- 
tron energy-loss spectrum [60] shown by dots. The electron impact energy is 
equal to 1 keV and the electron scattering angle is equal to 1.5° in Figure 2a 
(left pannel) and to 10° in Figure 2b (left pannel). By the dashed line we 
show the dominating dipole plasmon contribution (Fig. 2a, left pannel) and 
the quadrupole plasmon contribution (Fig. 2b, left pannel). The monopole 
term does not contribute much to the cross section, since there practically 
exists no monopole surface plasmon excitation mode in the system. In 
Figure 2c (left pannel) the dominating octupole contribution is shown by 
the dashed line while the guadrupole and the dipole contributions are shown 
by dotted and dashed-dotted lines respectively. Unfortunately, there exist 
no experimental data related to this scattering angle. 

Figures 2a, b (left pannel) show quite reasonable agreement of theoretical 
results with the experimental data. At low momentum transfer (9 = 1.5°) 
the dipole excitation dominates in the energy loss spectrum (Fig. 2a, left 
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Fig. 2. Left pannel: the differential cross section dcr/de'dfi (5.4) as a fnnction 
of the transferred energy Ae calcnlated for the electron-fnllerene Ceo collision [17]. 
The impact electron energy is e = 1 keV. The scattering angle is 9 = 1.5° (a), 9 = 
10° (b) and 9 = 5° (c). The resnlts of the resonance plasmon approximation are 
shown by solid lines. Dots in figures (a) and (b) represent the experimental data 
from [60]. Dashed lines show the leading mnltipole (dipole (a), quadrupole (b) and 
octnpole (c)) plasmon contribntion to the spectrnm. In Figure Ic, contributions 
of the qudrupole and the dipole plasmon modes are shown by dotted and dashed- 
dotted lines respectively. Right pannel: the differential cross section, d(r/de'dD, 
(5.4) as a function of the transferred energy Ae calculated for the electron-Na4o 
collision [17]. The impact electron energy, e = 50 eV. The electron scattering 
angle is 6 = 1° (a), 0 = 6° (b) and 9 = 8° (c). Solid lines represent the resnlts of 
the RPAE calculation with the Hartree-Fock jellium model basis wave functions. 
Thick solid line is the total energy loss spectrum. Thin solid lines marked with 
the angular momentum number represent various multipole contributions to the 
energy loss spectrum. By dashed line we plot the electron energy loss spectrum 
calculated in the resonance plasmon approximation. 
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pannel), while for 6 = 10° the quadrupole contribution (Fig. 2b, left pannel) 
provides the main contribution. As a result the position of the maximum 
of the energy loss spectrum shifts from the dipole plasmon frequency oji at 
9 = 1.5° to the quadrupole plasmon frequency U 2 at 0 = 10°. At 0 = 5°, 
the position of the maximum is close to the octupole plasmon resonance 
frequency W3 (Fig. 2c, left pannel). 

To check the validity of the pure resonance treatment of plasmon exci- 
tations in a cluster and establish the relative role of the collective modes 
in the formation of the inelastic scattering cross sections, let us compare 
the results obtained from the direct quantum calculations with those de- 
rived from the resonance treatment of plasmon excitations in a cluster. In 
Figures 2a-c (right pannel) the differential energy loss spectra, Aedcr/de', 
(5.1) as a function of the transferred energy Ae are shown for the Na4o tar- 
get cluster at the electron scattering angles 0=1° (a, right pannel), 0 = 6° 
(b, right pannel) and 0 = 8° (c, right pannel) respectively [17]. 

Figure 2a (right pannel) shows that at 1° the dipole plasmon excitation 
dominates in the electron energy loss spectrum. The quadrupole excitation 
provides relatively small contribution to the spectrum. The contributions 
of the monopole and all higher multipole excitations are almost negligible 
at this scattering angle. Figure 2b (right pannel) demonstrates that at 
6° the quadrupole excitation becomes the leading excitation in the electron 
energy loss spectrum, shifting the maximum of the spectrum towards higher 
energies and also changing the profile of the resonance. The dipole and 
the octupole excitations provide also considerable contributions in a wide 
range of transferred energies broadening the spectrum. The dominance of 
quadrupole excitation is not as large as for the dipole excitation at 1°. 
At 9° the picture (Fig. 2c, right pannel) becomes more complex. In this 
case the octupole excitations provide the dominating contribution to the 
spectrum in the vicinity of the maximum of the energy loss spectrum at 
Ae « 4 eV. Besides this region the dipole, quadrupole and even excitations 
with angular momentum 4 give comparable contributions to the energy loss 
spectrum and form rather broad structure. The monopole excitation and 
the excitations with angular momentum 5 and higher are almost negligible. 
With increasing scattering angle, excitations with the angular momentum 
4 become more important. However, the corresponding spectrum does not 
possess a resonance behaviour, because it is mainly formed by single electron 
transitions. 

Comparison of the results derived from the RPAE calculations with those 
obtained in the resonance plasmon approximation shows that, in spite of 
the simplicity, the plasmon resonance treatment is in quite good agreement 
with the consistent many-body quantum calculation. The main discrep- 
ancy between the two approaches arises from the single particle transitions 
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omitted in the resonance plasmon approximation, but taken into account 
in the RPAE calculation. These transitions bring some structure to the 
final energy loss spectra manifesting themselves over the smooth resonance 
behaviour which is reproduced by the resonance plasmon approximation. 
At larger scattering angles plasmons with larger angular momenta can be 
excited. However, as we know, excitations with large enough angular mo- 
menta occur due to single particle transitions rather than due to collective 
excitations. Therefore, the agreement between the resonance plasmon ap- 
proximation and the RPAE is better at small angles. 



7 Surface and volume plasmon excitations in the formation 
of the electron energy loss spectrum 



Now let us discuss the formation of the widths of plasmon resonances. 
Damping of the plasmon oscillations is connected with the decay of the 
collective electron excitations to the single-particle ones similar to the mech- 
anism of Landau damping in infinite electron gas. Frequencies of the surface 
plasmon excitations in neutral metal clusters usually lie below the ionization 
threshold. Therefore single-particle excitations in the vicinity of the surface 
plasmon resonance have the discrete spectrum. In this case the width of a 
surface plasmon excitation caused by the Landau damping should be treated 
as the width of the distribution of the oscillator strengths in the vicinity of 
the resonance. The problem of the formation of the surface plasmon reso- 
nance widths in clusters has been studied during past years in a number of 
works [19,69,71,72]. 

Similar situation takes place for volume plasmon excitations in metal 
clusters [19]. In metal clusters the resonance frequencies of volume plas- 
mons are above the ionization threshold. This means that the volume plas- 
mon excitations are quasi-stable. They have the real channel of the Landau 
damping leading to the ionization of the cluster. Thus the process of in- 
elastic scattering in the region of transferred energies above the ionization 
threshold can be described as follows. The projectile particle induces the 
oscillations of the electron density in the cluster. Oscillations of the electric 
field caused by the electron motion result in the ionization of the cluster. 
Note that the similar scenario takes place with damping of the surface plas- 
mon resonances in fullerenes [6], which also decay via the autoionization 
channel. 

The differential cross section of the electron inelastic scattering on metal 
clusters obtained in the plasmon resonance approximation with accounting 
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for both surface and volume excitations [19] reads as 



dV 
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Here Wp = y^3Ne/a is the volume plasmon resonance frequency, uii = 
yjl/{2l + l)o;p is the frequency of surface plasmon excitation with the an- 
gular momentum I, Ng is the number of delocalized electrons, a is the static 
polarizability of the cluster, F„; and T^i are the widths of the volume and 
surface plasmon resonances, which are defined below. Note that volume 
plasmon excitations with different angular momenta have the equal reso- 
nance frequency Wp. 

This cross section is totally determined by collective electron excitations 
in the cluster. The first and the second terms in (7.1) describe contributions 
of the surface and the volume plasmon excitations respectively. Note, that 
similar expression for the cross section have been also obtained in [70] for 
electron scattering on small metal particles by means of classical electrody- 
namics. 

According to [19], the width of the surface plasmon resonance in the 
plasmon resonance approximation is equal to: 

^ S{un -Ef, + (7.2) 



where (^sj(r) = ((r/i?)^0(i? — r) -I- (i?/r)(*+^^0(r — i?)) Y/m(n). Note that 
the same expression was obtained earlier in [69, 71] using other methods. 
Evaluation of the expression (7.2) for sufficiently large clusters leads to the 
well known result for the Landau damping of the surface plasmon oscillations 
= 31vf/R [71], where up is the velocity of the cluster electron on the 
Fermi surface. 

In the plasmon resonance approximation one can also determine the au- 
toionization width of the volume plasmon resonance [19], which is equal to: 

_ SlT^ajp I /-O', 

q^R^ {jf{qR) - ji+i{qR)ji_i{qR) - -^ji+i{qR)ji{qR)y 

where ipvi{v) = (ji{qr) - ji{qR){r / R^) 0{R - r)Yim{ n). Here the summa- 
tion is performed over the occupied single- electron states jz and the integra- 
tion is performed over the electronic states p of the continuous spectrum. 
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The projectile particle excites simultaneously numerous modes of the 
volume plasmon. The sum of the potentials of all the modes gives the re- 
sulting potential ipvi{r). It is essential that all normal modes of the volume 
plasmon have the same resonance frequency Wp, but the excitation proba- 
bility for these modes depends on the kinematics of collision. This leads to 
the dependence of the volume plasmon potential upon the transferred 

momentum. The oscillations of the volume plasmon potential result in the 
ionization of the cluster, which probability and the volume plasmon reso- 
nance width depend on transferred momentum q. However, the numerical 
analysis performed in [19] shows that the dependence of T„/ on q is rather 
weak in the region of g ^ 1, where collective electron oscillations mainly 
take place. Therefore the volume plasmon resonance width with the given 
I can be approximated by the limiting value following from (7.3) at g = 0: 

r„/ = J S{uJp-e^ + e^)d^^, (7.4) 

where iph{r) = {r/RY (l - (r/RY) 9{R - r)Yim{n). 




q, a.u. 

Fig. 3. Autoionization width of the dipole (1), the quadrupole (2) and the 
octupole (3) volume plasmon excitations as a function of transferred momen- 
tum q [19]. 

Figure 3 shows the dependence of the autoionization width on the trans- 
ferred momentum g for the volume plasmon modes, which provide signif- 
icant contribution to the EELS. The width of the dipole, the quadrupole 
and the octupole volume plasmon resonances has been calculated according 
to (7.3). The transferred momentum g plays the role of the wave vector for 
the volume plasmon excitations. All three plasmon modes have the similar 
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Fig. 4. Differential cross section, d(r/de'dD as a function of the transferred en- 
ergy Ae calculated for the collision of 50 eV electron with the Na 4 o clusters [19]. 
The electron scattering angle is 0 = 9°. Solid lines represent the RPAE results. 
Thick solid line is the total energy loss spectrum. Thin solid lines marked by 
the corresponding angular momentum number represent various partial contribu- 
tions to the energy loss spectrum. Contributions of the surface and the volume 
plasmons calculated in the plasmon resonance approximation (7.1) are shown by 
dashed and dotted lines respectively. Dashed-dotted line represents the sum of 
the surface and volume plasmon contributions to the EELS. 



dependence of upon q. The width grows slowly in the region of small q 
and it decreases rapidly at larger q. In the latter region the probability of 
volume plasmon excitation by the incoming electron is correspondingly re- 
duced. Note that the wave length of a collective electron oscillation should 
be larger than the inter-electronic distance in the cluster, i.e. plasmon wave 
vector should be smaller than the Fermi momentum of cluster electrons 
q < 0.5. In the region q < 0.5, where the latter condition is fulfilled, the de- 
pendence of upon q is rather weak. We can approximate the resonance 
width by following values F„i ~ O.bwp, F „2 — O.Swp, F „3 ~ 0.23wp. Con- 
trary to surface plasmons, the autoionization width of a volume plasmon 
decreases with the growth of the angular momentum. 

Let us consider electron energy loss spectra in collision of an electron 
with the Na 4 o cluster in the region above the ionization potential, where 
volume plasmon modes become significant. 

Figure 4 shows the EELS in the region above the ionization threshold 
Ae > 3.3 eV calculated using the RPAE. The energy of the collision is 
equal to e = 50 eV and the scattering angle 9 = 9°. Thick line corresponds 
to the total energy loss spectrum, while thin lines show various partial 





A.V. Solov’yov: Electron Scattering 



425 



contributions. The angular momenta corresponding to these lines are 
marked by numbers from 0 to 4. The partial contribution to the EELS 
with I < 3 have the broad maximum in the vicinity of Ae ~ 5.1 eV. Com- 
parison of the EELS calculated in the two different approaches proves our 
assumption that the peculiarity in the EELS in the vicinity of Ae ~ 5 eV is 
connected with the volume plasmon excitation. Figure 4 demonstrates that 
collective excitations provide dominating contribution to the total EELS 
determining its pattern. 



8 Polarization effects in low-energy electron cluster collision and the 
photon emission process 

Our consideration in the previous sections has been mainly focused on the 
collisions of fast electrons with metal clusters and fullerenes, because the 
results of the fast electron-cluster collisions theory have the most straight- 
forward and simple connection to the experiment. 

However, the review of the electron-cluster collisions theory would not 
be complete if one says nothing about the low energy electron- cluster col- 
lisions. In the low energy electron cluster collision, the electron collision 
velocity is lower or comparable with the characteristic velocities of the clus- 
ter delocalized electrons. This criterion can be traced from the Born theory 
of electron-cluster collisions, which have very much in common with the 
Born theory of electron-atom collisions [54]. In [15], it was shown that 
the electron collisions with metal clusters in the region of collision energies 
below 3-5 eV should be treated as slow, while for fullerenes, this region 
extends up to 30 eV. 

In the low energy electron-cluster collisions the role of the cluster 
polarization and exchange correlation effects increases dramatically. The 
polarization potential of electron-cluster interaction sometimes changes 
completely the qualitative picture of the collision. This for example takes 
place, when considering low energy electron elastic scattering on metal clus- 
ters. In this case the resonant structures can appear in the energy depen- 
dence of the electron elastic scattering cross section due to the presence 
of the bound or quasi-bound states in the system [73,74]. The resonance 
structure turns out to be very sensitive to the choice of the approximations 
made for its description and has not been experimentally observed so far. 

In [73, 74] consistent many-body theory based on the jellium model and 
the Dyson equation method has been used for the description of low en- 
ergy elastic electron-cluster collisions. Calculation of the low energy elastic 
scattering cross section was also performed in the semiclassical approxima- 
tion [75] and entirely quantum mechanically [76]. The influence of the choice 
of various LDA potentials on the elastic scattering of low-energy electrons 
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(below 10 eV) was considered in [77]. Low energy electron-fullerene scat- 
tering was studied experimentally in [78,79]. Collisions of low and interme- 
diate energy electrons with metal clusters were experimentally investigated 
in [27-30] . The response function of fullerene and its relation to the inelastic 
scattering problem was considered in [80] . 

During the last years, considerable attention has been devoted both 
experimentally and theoretically to the problem of electron attachment to 
metal clusters and fullerenes. The electron attachment process is one of 
the mechanisms, which lead to the negative cluster ion formation in gases 
and plasmas and thus it attracts the interest of numerous researches. For 
fullerenes, the experimental observations of electron attachment have been 
performed in [78,79]. For metal clusters, the electron attachment problem 
has been the subject of the intensive experimental [27-30] and theoretical 
investigations [20,21,24,26] and is not yet completely understood. Let us 
further discuss this problem in more details. 

The very simple picture of attachment is described in many textbooks, 
see e.g. [81]. Let us assume that there exists a Langevin attraction potential 
of the form 

= ( 8 . 1 ) 



outside the cluster radius. The constant a is the static polarizability of the 
cluster. One can then show that there is an orbiting cross section 



a = TT 






( 8 . 2 ) 



which sets an upper limit bound to the attachment cross section (so called 
the Langevin limit). Here e is the kinetic energy of the projectile electron. 
This simple treatment, if valid, would explain the behaviour of the cross 
section in the vicinity of the threshold. 

It is known that metal clusters possess a high polarizabilty, see e.g. [9]. 
Hence, large capture cross sections are anticipated. However, simple at- 
tempts to account for attachment by using the static polarizabilty a are 
not in accordance with observation [27]. A recent review of electric polar- 
izability effects in metal clusters is given in [29]. The great weakness of 
the Langevin model is the treatment of a as an approximate constant. In 
fact it possesses a complicated energy dependence, due to the dynamical 
polarizabilty of the metallic cluster. 

The possibility of resonances in the capture cross section was consid- 
ered, first, theoretically in [20,21]. It was theoretically demonstrated that 
electrons of low energy can excite a collective plasmon resonance within the 
metal cluster in the electron attachment process as a result of a strong dipole 
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deformation of the charge density of the cluster. Later this idea commented 
in the context of the measurements performed in [28], although no clear 
evidence of resonant behaviour have been found. In [28] there have been 
measured total inelastic scattering cross sections, which include attachment 
as only one of several possible contributing channels. 

The resonant electron attachment mechanism was named in [20,21] as 
polarizational. An important consequence of the polarization mechanism 
is that the low energy electron falls into the target and the probability of 
this process is enhanced. Since the process as a whole is resonant, the 
enhancement is greatest for energies rather close to the plasmon resonance 
in the dynamic polarizability of the cluster. This fact explains another 
reason for the interest to this process. Indeed, the predictions of resonantly 
enhanced electron capture by metal clusters have been based on the jellium 
model. While there is convincing evidence for the jellium picture through 
the occurrence of magic numbers, ellipsoidal structures in studies of the 
stability of metallic clusters and through the energy splittings of plasmon 
resonances, the range of validity of the jellium model remains uncertain and 
is subject to current discussions. 

In the attachment process the electron losses its excess energy. Polar- 
izational bremsstrahlung is one of possible channels of the electron energy 
loss [20-23,25]. Energy of the electron can also be transferred to the excita- 
tions of the ionic background of the cluster [32], which may lead to increase 
of its vibrations and final fragmentation. In spite of the significant phys- 
ical difference between various channels of the electron energy loss, they 
have one important common feature: they all go via the plasmon exci- 
tation. Therefore, calculating the total electron attachment cross section 
including all possible channels of the electron energy loss in the system, one 
obtains [24, 26] qualitatively similar dependence of the cross section as it 
was obtained initially for the radiative channel of electron energy loss [20] . 

In [20] the attachment cross section has been calculated within the jel- 
lium model in a scheme which holds best if the kinetic energy of the elec- 
trons is somewhat higher than the energy of the resonance. Also, it was 
assumed that the attached ion is created in the ground state. As a useful 
step in simplifying the calculation, a Kramers-Kronig transformation pro- 
cedure was introduced to compute the polarizability from the absorption 
coefficient, thereby circumventing the need for full ab initio calculations. 

Within this approximate scheme, it was found [20,21] that the resonant 
attachment cross section dominates over the non-resonant by a factor of 
about 10^-10"^ near resonance, and is therefore a very significant pathway 
for electrons of low enough energy. 

In [24, 26] , the earlier theoretical work on attachment was extended by 
including the following improvements: (a) there were included all possible 
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Fig. 5. A comparison between theory and experiment in the vicinity of the plas- 
mon resonance. The theoretical curves from [26] and includes the photoabsorption 
spectrum. The experimental points are from [30]. 

channels of the electron attachment and calculated the total cross section of 
the process rather that analyzed a particular single channel, no assumption 
that the system can only return to its ground state had been made; (b) the- 
oretical approximation allowed to treat electron energies not only in the 
resonance region, but actually throughout the range of interest; (c) there 
were performed an RPAE calculation of the dynamical polarizability and 
the corresponding electron attachment cross sections on the basis of the con- 
sistent many body theory with the use of the Hartree-Fock jellium model 
wave function; (d) calculations were performed for both neutral and charged 
cluster targets; (e) the polarization effect on the incoming particle has been 
taken into account and collective excitations of different multipolarity in 
the target electron system are taken into account; (f) Dyson’s equation was 
used to reduce the problem of the interaction of an extra electron with a 
many-electron target system to a quasi-one-particle problem in a similar 
way as it was done for negative atomic ions calculations [53] . 

An example of such calculation is shown on the upper plot of Figure 5. 
This plot represents the total and partial electron capture cross sections 
calculated for neutral potassium Ks cluster. The insert demonstrates the 
photoabsorption spectrum of ATg. In [26] this calculation was performed in 
various approximations outlined in the previous paragraph. It was found 
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that the resonance pattern in the electron capture cross section for the Kg 
cluster turns out to be similar in various approaches, although for some other 
sodium and potassium clusters it is more sensitive to the approximations 
made [26]. The plasmon resonance in the electron capture cross section is 
shifted on the value of energy of the attached electron as compared to the 
photoabsorption case shown in the insert. 

Experimental evidence for the resonance enhancement of the cross sec- 
tions of electron attachment process has been recently obtained [30]. The 
experimental points from [30] are shown on the lower panel of Figure 5. 
Comparison of the upper and lower parts of Figure 5 shows that the rea- 
sonable agreement between the predictions of theory and the experimental 
results has been achieved, although more precise measurements would be 
desirable to resolve the more detailed structures in the electron attachment 
cross sections. 

Note that in its present state, the theoretical description of metallic 
clusters does not allow such complex processes as resonantly enhanced at- 
tachment (in which electron correlations play a dominant role) to be com- 
puted with the inclusion of the ion core structure. Fortunately, it has been 
shown that the jellium approximation provides a good theoretical frame- 
work, capable of predicting many properties of metallic clusters. Within 
this framework, all the refinements of the RPAE become acceptable for the 
calculation of collective phenomena dominated by electron correlations. 

It is also worth to mention that the plasmon resonance enhanced mech- 
anism of electron attachment considered above is typical for metal clusters 
rather than for fullerenes. Although fullerenes have many similarities in the 
properties with metal clusters and also possess the plasmon resonances, the 
energies of these resonances are much higher (~7 eV and ~19 eV) and thus 
cannot be reached at low kinetic energies of the projectile electron. 

Finally, let us note that strong polarization effects arise also in the pro- 
cess of photon emission by an electron colliding with a cluster [21-23,25]. 
Such a process is known as the polarization bremsstrahlung or polarization 
radiation (see e.g. [23]). In these papers it was demonstrated that the plas- 
mon resonance structure manifests itself in the photon emission spectrum 
in collisions of electrons with metal clusters and fullerenes. This effect oc- 
curs because the cross section of this process is mainly determined by the 
dynamical polarizability of the cluster and is applicable to any polarizable 
system possessing a collective giant resonance. 



9 How electron excitations in a cluster relax 

In metal clusters, the plasmon resonances lie below the ionization thresh- 
olds, i.e. in the region of the discrete spectrum of electron excitations [11]. 
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This fact rises an interesting physical problem about the eigen widths of 
those electron excitations, which possess large oscillator strengths and form 
the plasmon resonances. Knowledge of these widths is necessary for the 
complete description of the electron energy loss spectra, electron attach- 
ment, polarizational bremsstrahlung and photoabsorption cross sections in 
the vicinity of the plasmon resonances and the description of their depen- 
dence on the cluster temperature. Note, that experimentally the depen- 
dence of the plasmon resonance photoabsorption patterns of metal clusters 
on temperature has been studied in [82]. 

In metal clusters, the origination of the electron excitation widths is 
mainly connected with the dynamics of the ionic cluster core [31,32,83-87]. 
This is an example of effect, which has no analogy in atomic physics. Con- 
trary, in fullerenes, the discrete transitions with the energies above the ion- 
ization threshold, which form mainly the plasmon resonance in the vicinity 
of 20 eV, possess the autoionization widths [6]. The latter mechanism of 
the line width broadening is well known in atomic physics [52] and we do 
not discuss it here. 

Instead, let us focus on the influence of the dynamics of ions on the 
motion of delocalized electrons in metal clusters and discuss it on the basis of 
the dynamic jellium model suggested in [31] and further developed in [32,33] . 
This model generalizes the static jellium model [36-39], which treats the 
ionic background as frozen, by taking into account vibrations of the ionic 
background near the equilibrium point. The dynamic jellium model treats 
simultaneously the vibration modes of the ionic jellium background, the 
quantized electron motion and the interaction between the electronic and 
the ionic subsystems. In [31] the dynamical jellium model was applied for a 
consistent description of the physical phenomena arising from the oscillatory 
dynamics of ions. 

The dynamic jellium model [31] allows to calculate widths of the elec- 
tron excitations in metal clusters caused by the dynamics of ions and their 
temperature dependence accounting for two mechanisms of the electron ex- 
citation line broadening, namely adiabatic and non-dynamic ones. 

The adiabatic mechanism is connected with the averaging of the elec- 
tron excitation spectrum over the temperature fluctuation of the ionic back- 
ground in a cluster. This phenomenon has been also studied earlier in a 
number of papers [31,32,83-87]. The adiabatic linewidth is equal to 



r = 



/41n2 



cth 



n 

2 ^ 






(9.1) 



Here m and are the mass and frequency corresponding to the generalized 
oscillatory mode considered, T is the cluster temperature, k is the Bolzmann 
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constant, 




is the matrix element of the electron phonon coupling, cal- 



culated for surface and volume cluster vibration modes in [32]. 

The mechanism of dynamic or non-adiabatic electron excitation line 
broadening has been considered for the first time in [31,32]. This mech- 
anism originates from the real multiphonon transitions between the excited 
electron energy levels. Therefore the dynamic linewidths characterize the 
real lifetimes of the electronic excitations in a cluster. The analytic expres- 
sion for the non-adiabatic width obtained in [31,32] is rather cumbersome 
as compared to (9.1) and thus we do not present it here. 

The adiabatic broadening mechanism explains the temperature depen- 
dence of the photoabsorption spectra in the vicinity of the plasmon res- 
onance via the coupling of the dipole excitations in a cluster with the 
quadrupole deformation of the cluster surface. The photoabsorption spec- 
tra were calculated within the framework of deformed jellium model using 
either the plasmon pole approximation [83, 84] or the local density approx- 
imation [85-89]. 

In [31,32] both the adiabatic and non-adiabatic linewidths of electron 
excitations in the vicinity of the plasmon resonance caused by coupling of 
electrons with various ionic vibration modes have been calculated. The 
non-adiabatic linewidths characterize the real lifetimes of cluster electron 
excitations. Naturally, the non-adiabatic widths turn out to be much smaller 
than the adiabatic ones due to the slow motion of ions in the cluster. How- 
ever, the adiabatic linewidths do not completely mask the non-adiabatic 
ones, because the two types of widths manifest themselves differently. The 
adiabatic broadening determines the pattern of the photoabsorption spec- 
trum in the linear regime. The non-adiabatic linewidths are important 
for the processes, in which the real lifetime of electron excitations and the 
electron-ion energy transfer are essential. The information about the non- 
adiabatic electron-phonon interactions in clusters is necessary for the de- 
scription of electron inelastic scattering on clusters [15-17,19], including the 
processes of electron attachment [20,24,26], the non-linear photo-absorption 
and bremsstrahlung [21-23,25], the problem of cluster stability and fission. 
The non-adiabatic linewidths determined by the probability of multiphonon 
transitions are also essential for the treatment of the relaxation of electronic 
excitations in clusters and the energy transfer from the excited electrons to 
ions, which occurs after the impact- or photoexcitation of the cluster. 

In [31,32] there was investigated the role of the volume and the surface 
vibrations of the ionic cluster core in the formation of the electron excitation 
linewidths and demonstrated that the volume and surface vibrations pro- 
vide comparable contributions to the adiabatic linewidths, but the surface 
vibrations are much more essential for the non-adiabatic multiphonon tran- 
sitions than the volume ones. Calculations of adiabatic and non-adiabatic 
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linewidths for sodium clusters and analysis of their temperature dependen- 
cies have been performed in [31,32]. 

10 Concluding remarks 

The have considered a number of problems arising in fast and slow electron 
cluster collisions. The choice of these particular problems was greatly influ- 
enced by the experimental efforts undertaken in the field. However, there 
are many more interesting problems in the field, which have been left aside 
in this lecture, but with no doubts deserve also careful theoretical and ex- 
perimental consideration. Concluding this lecture I would like to mention 
some of them: 

• further improvements of jellium model can be incorporated in the- 
ory, which allow accounting for the detailed ionic structure and the 
fragmentation channels of the system in various collision processes; 

• it is interesting to elucidate collective nature of two electron excita- 
tions in a cluster and their manifestation in collision processes; 

• both elastic and inelastic low energy electron-cluster collisions may 
become a good test for the many-body theories and the applicability 
of simple cluster jellium models; 

• it is interesting to study consequences of the cluster deformations in 
various collision processes; 

• molecular dynamics methods can be used for the description of elec- 
tron - phonon coupling and electron excitation relaxation in atomic 
clusters; 

• deeper insight in theory of various collision processes, in which non- 
adiabatic multiphonon transitions play essential role; 

• generalization of the problems and application of the methods de- 
scribed in the lecture to other types of clusters and nonostructures. 

I would like to express my gratitude to the organizers of the Summer School in 
Les Houches for the opportunity to deliver this lecture and also to the Volkswagen 
Foundation, Deutsche Forschungsgemeinschaft and INTAS for the support of this work. 
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1 Introduction 

This overview of energy landscapes will focus on some rigorous properties 
of potential energy surfaces (PES’s), especially the symmetry properties 
of steepest-descent paths, and on thermodynamics and cluster simulation. 
More detail concerning topics such as visualisation of global PES’s and 
master equation dynamics may be found in a complementary review [1] . The 
present notes will also touch upon the Born-Oppenheimer approximation 
[2], tunnelling and global optimisation, but in rather less detail. 

In classical mechanics the potential energy function, V , determines the 
structure, dynamics and thermodynamics of any system. Stable configura- 
tions occur at minima in P, defining the structure; the gradient of V gives 
(minus) the forces on the particles, which appear in the equations of mo- 
tion; and the configuration integral, which determines the thermodynamics, 
is also a function of V. Analogous statements can be made in quantum me- 
chanics within the Born-Oppenheimer approximation (Sect. 2), where the 
potential energy surface (PES) is the solution of the Schrodinger equation 
for frozen nuclear positions. 

When we are interested in the structure and physical properties of a 
chemical system we often mean the properties of the global minimum on the 
PES, or the properties at zero Kelvin. However, the dynamics of a system 
at temperatures or energies where it can escape from the global minimum 
depend on larger regions of the surface. The topology and topography of 
these extended regions determine the precise behaviour. 

When considering the wider features of the PES in this way, it has 
become common to refer to it as the “potential energy landscape”, or just 
the “energy landscape” . The latter expression is also used to refer to free 
energy, though this should be clear from the context. 

As the size of the system increases, so does the dimensionality of the 
configuration space required to describe it (Sect. 3.1). In the late 1960’s the 
first attempts were being made to predict the three-dimensional structure 

© EDP Sciences, Springer- Verlag 2001 
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of globular proteins from their amino acid sequence. In 1964 Anfinsen had 
shown that some denatured proteins regained their native structure reli- 
ably on a laboratory time scale [3]. By coarse-graining configuration space, 
Levinthal realised that the number of possible conformations for a typical 
protein is astronomically large [4] . If they were searched at random on the 
time scale of a typical vibration the time required to find the native state 
would probably exceed the lifetime of the universe. For example, with just 
two conformations per amino acid residue, searched at a rate of 10^^ s“^, 
it would take 2^™ x 10“^^ s to visit them all for a 100 residue protein, i.e. 
around 4 X 10^° years. This discrepancy has come to be known as Levinthal’s 
paradox. It has now been largely resolved from a better understanding of 
energy landscapes [1]. 



1.1 Levinthal's paradox 



The first attempts to address the paradox centred on the idea that not all 
the possible configurations have to be searched. Levinthal himself suggested 
that folding might occur via a specific pathway, initiated by the formation 
of a condensed nucleus and proceeding through a well-defined sequence of 
events [5]. Alternatively, it was also suggested that different regions of 
secondary structure could start to form simultaneously or that the initial 
random coil might collapse to a compact state or molten globule, which 
would then need to rearrange. Simulations of simple lattice models suggest 
that the initial collapse is indeed fast and insensitive to the residue sequence 
[6,7]. However, some proteins, such as cytochrome c, do not appear to follow 
this route [8]. 

It seems unlikely that a reduction in the search space is the solution 
to the paradox, since the number of states remaining would still be too 
large for a random search to succeed. However, such ideas also contain the 
implicit notion that, in some way, the search is not random. In terms of the 
energy landscape there are two reasons for this. Firstly, conformations have 
different statistical weights in the thermodynamic ensemble, and secondly, 
they are not arranged at random in configuration space. 

Levinthal’s analysis assumes that the energy landscape is fiat, like a golf 
course with a single hole corresponding to the native state (Fig. 1) [9]. For 
a simple model that includes an energetic bias towards the native structure 
the search time can be dramatically reduced to physically meaningful scales 
[ 10 , 11 ]. 
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Fig. 1. Flat (left) and biased (right) golf courses. 



Leopold, Montal, and Onuchic subsequently proposed that the landscape 
of a naturally occurring protein consists of a collection of convergent kinetic 
pathways that lead to a unique thermodynamically stable native state [12]. 
This landscape structure was termed a folding “funnel” because it focuses 
the manifold of misfolded states towards the correct target. The random 
search element of Levinthal’s “paradox” is simply removed. In fact, we could 
ask similar questions about crystallisation, which may be an even more 
statistically improbable event, and how it is that buckminsterfullerene [13] 
is selected from a mass of vaporised small carbon chains when there are vast 
numbers of other possible isomers (Fig. 2)? 



Ceo 





Fig. 2. Schematic time of flight mass spectra for carbon clusters prepared by laser 
vaporisation at high (left) and medium (right) buffer gas pressures. 

Free energy funnels for proteins, which incorporate the entropy, have 
now been investigated for a variety of models [9, 14-16]. To do this it is 
necessary to express the free energy (usually Helmholtz) in terms of or- 
der parameters. The free energy landscape depends upon temperature and 
mass - the potential energy surface does not. 

Often the dependence of the free energy, A, is expressed in terms of 
a single order parameter, Q, which is designed to somehow measure the 
similarity of the protein configuration to the native state. The function 
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A{Q) 





Q 
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Fig. 3. Schematic protein free energy surface as a function of the radius of gy- 
ration, Rg, and the number of native contacts, Q. N = native state, C = “ran- 
dom coil” , G = “molten globule” . 

A{Q) may exhibit a double minimum over a particular temperature range, 
allowing a thermodynamic “transition state” to be defined between folded 
and unfolded states (Fig. 3). Some simulations suggest that the folding 
ability can be measured by the ratio of the folding temperature, Tf, below 
which the native state has the lowest free energy, to the “glass” transition 
temperature, Tg, where the kinetics exhibit a dramatic slowing down [7,17]. 
Folding is easiest for large TfjTg, since the native state is then statistically 
populated at temperatures where it is kinetically accessible. An important 
research goal is to understand why some amino acid sequences fold rapidly 
and reliably whilst others do not. 

For a simple lattice model it was suggested that the necessary and suffi- 
cient condition for folding was a global potential energy minimum lying well 
below the other local minima [6,18]. This situation certainly increases the 
value of Tf , but the strength of the correlation between efficient folding and 
the energy gap separating the global minimum from the next lowest state 
has been questioned [19,20]. Relaxation from high-lying states must also de- 
pend on global properties of the potential energy surface, not just the region 
around the global minimum [21]. In fact, surfaces exist with pronounced 
potential energy minima that are kinetically inaccessible (Sect. 3) [1]. 

The energy of a protein depends on interactions between parts of the 
chain that come into contact as it folds. If the favourable interactions 
that define the native state involve bringing together other residues that 
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interact unfavourably, the process is energetically frustrated. Geometric 
frustration arises if two similar structures can only be interconverted by 
disrupting contacts that have been made elsewhere. Both these effects lead 
to roughening of the potential energy landscape. Proteins that avoid these 
problems as far as possible are said to obey the principle of “minimum 
frustration” [9]. 

It is now generally believed, or at least assumed, that the native state of 
a protein is the global free energy minimum at the appropriate temperature 
(Anfinsen’s thermodynamic principle) [9,22]. To the extent that protein 
folding is a search for the global potential energy minimum, it is an example 
of the more general problem of global optimisation (Sect. 8 ), which is of 
great interest in fields as different as economics and microelectronics. The 
success of any global optimisation algorithm is inextricably linked to the 
structure of the landscape, and an understanding of these connections has 
recently been exploited to design more effective techniques (Sect. 8 ) [23-25]. 



1.2 “Strong" and “fragile" liquids 

The potential energy landscape also plays an important role in determining 
the behaviour of both finite and bulk matter in the liquid state. Angell’s 
scheme classifies liquids on a scale from “strong” to “fragile” using both 
thermodynamic criteria, such as the entropy relative to the crystal, and 
dynamic criteria, such as the viscosity [26]. Dynamic criteria are generally 
related to a reorganisation or structural relaxation time scale. In these 
terms strong liquids are more “resistant” to structural change as a function 
of temperature. 

A strong liquid is characterised by a diffusion constant and viscosity 
that follow the Arrhenius form (oc exp[A/T]). These are often liquids with 
strongly bonded open network structures like Si 02 , BeF 2 and P 2 O 5 . Fragile 
liquids exhibit non- Arrhenius behaviour and tend to have more isotropic 
interactions (Fig. 4). Quantitative measures of strength and fragility have 
also been suggested [27]. 

Strong materials also have small heat capacity differences between the 
supercooled liquid and crystal; fragile materials exhibit large heat capacity 
differences. The apparent violation of the third law of thermodynamics by 
the extrapolated entropy of a supercooled liquid is known as the Kauzmann 
entropy paradox [28]. This possibility is avoided by the intervention of the 
glass transition where the viscosity increases rapidly and liquid-like diffusion 
ceases. 

Whether we consider the viscosity or a general relaxation time, t, the 
temperature dependence can often be described over a certain range by the 
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Fig. 4. Schematic behaviour of strong and fragile liquid viscosities. 




T/T„ 



Fig. 5. Variation of the excess entropy with T /Tm for some strong and fragile 
glass formers. 

empirical Vogel-Tammann-Fulcher (VTF) equation [29-31]: 



T = To exp[DTo/(T - Tq)], 



( 1 . 1 ) 
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where B, D and Tq are fitted parameters. Arrhenius behaviour is recovered 
for small To and large D. An alternative expression due to Ferry et al. often 
works equally well [32]: 



For strong liquids the relaxation behaviour of the property described by r 
is usually found to fit a simple exponential form, p{t) = p(0)e“*/'^, where t 
is the time. For fragile liquids a variety of alternative functional forms have 
been suggested, such as the Kohlrausch-Williams- Watts stretched expo- 
nential [33], p{t) = p(0) exp[— at®]. Debye relaxation corresponds to 0 = 1. 
Although the relation is by no means universal, increasing fragility is corre- 
lated with departure from Debye relaxation in a wide variety of liquids [34] . 
To define a general relaxation time we can use [35] 



which gives the expected result for Debye relaxation. 

As the temperature is lowered the absorption spectrum of a supercooled 
liquid or glass corresponding to dielectric relaxation often exhibits a peak 
that moves to lower frequency as the temperature decreases [36] . Once this 
primary or a peak has disappeared a weaker (3 absorption may be revealed 
which in turn moves to lower frequency as the temperature decreases. The 
a peak frequency as a function of temperature often exhibits non- Arrhenius 
behaviour and has been associated with slower molecular rearrangements. 

Adam and Gibbs provided a heuristic connection between thermody- 
namics and dynamics by considering “cooperative relaxation” to obtain [37] 



where A is a constant, is the free energy barrier per molecule in the 
cooperative group and Sc is the excess configurational entropy of the liquid 
over the crystal. The VTF equation (1.1) can be recovered if Cpjiquid — 



In fact it is not always possible to classify liquids unambiguously as 
strong or fragile, and some strong liquids become fragile at higher densities. 
For example, the diffusion of both Si and O atoms in silica becomes strongly 
non- Arrhenius when the density is increased by 30% [38]. 

Glasses often (not always) exhibit other characteristic features such as a 
low temperature heat capacity that varies linearly with T. This phenomenon 
may be due to quantum mechanical tunnelling (Sect. 5) between nearly 
degenerate minima separated by tiny barriers [39-41]. Many glasses also 
exhibit an excess vibrational density of states at low frequency compared to 



T = roexp[-T)/T^]. 



( 1 . 2 ) 




(1.3) 



T = To exp(AA/i/TS'c), 



(1.4) 
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the usual Debye dependence {g{v) oc v'^). This feature is termed the “boson 
peak” . 

Much has been measured and fitted in the field of supercooled liquids and 
glasses. However, the fundamental basis for all glass phenomenology must 
lie in the underlying PES, as Goldstein realised as early as 1969 [42]. For 
example, Angell has proposed that fragile liquids have more local minima in 
a given energy range to account for the larger heat capacity, with low bar- 
riers between them to give a low value of D in the VTF equation (1.1) [38]. 
The slow a and fast (3 processes are generally presumed to be associated 
with mechanisms involving larger and smaller barriers [35] . 



2 The Born— Oppenheimer approximation 



The Born-Oppenheimer approximation is a key element in the notion of a 
potential energy surface [2] . The Schrodinger equation for a molecule with 
n electrons, mass We, and N nuclei, masses Mg, is 



N 



-E 






2M. 



li 

^ 2m, 
1 — 1 



-Vf + V(x,X) 



V-(x,X) = Fe,actV^(x,X), (2.1) 



where x and X represent the electronic and nuclear coordinates. The po- 
tential energy is 



P(x,X) 




( 2 . 2 ) 



where Zt is the atomic number (nuclear charge) of nucleus t and e is the 
modulus of the electronic charge, ra = [x^ — Xt| where x^ and Xj are the 
position vectors of electron i and nucleus t, etc. 

It is more convenient to use atomic units where e = 1, rUe = 1, 47reo = 1 
and Ti = 1. The units of energy and length are then the hartree, Ah = 
jAireoao, and the Bohr radius, ag = /mee^. 

We denote the total Hamiltonian operator in equation (2.1) by li, and 
the nuclear kinetic energy operator (the first term in Eq. (2.1)) by Tn- The 
Born-Oppenheimer electronic wavefunction is defined as the solution of 

{n - V-e = (2.3) 

ifie is a function of the electronic coordinates x, but only depends upon 
the nuclear positions X parametrically, because equation (2.3) is solved for 
some particular nuclear geometry. Hence we write '0e(x;X) and Fe(X) to 
show that different electronic wavefunctions and energies are obtained for 
different atomic structures. 
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The potential energy surface defines the variation of the electronic en- 
ergy, Ee(X), with the nuclear geometry. In a diatomic molecule we simply 
consider Ee{R) as the bond-length, R, varies. 

Assuming that Ee(X) defines an effective potential in which the nuclei 
move, the appropriate Schrodinger equation for the nuclear wavefunction, 

V'n(X), is 

+ Ee(X)) ^„(X) = EBoV'n(X). (2.4) 

We now write '0(x, X) = '0e(x; X)'0n(X), substitute this into equation (2.1) 
and neglect all the terms involving derivatives of ^/)e(x;X) with respect to 
nuclear coordinates, i.e. ' 0 nV(' 0 e and Vt' 0 n • Vt'i/'e- A classical mechanical 
justification for this approximation is that the ratio of the mass of a proton 
to the mass of an electron is about 1836. Hence, nuclear velocities are ex- 
pected to be very small in comparison with electronic velocities. Separating 
the variables gives (divide by i/;e(x; X)^/>„(X)) 



^ V^V;e(x;X) 

2^;e(x;X) 



l/(x,X) 



VtVn(X) 



+ Ebo 



E,{X). 



(2.5) 



Hence we recover equations (2.3) and (2.4). The key point is that the latter 
two equations are much more tractable than equation ( 2 . 1 ). 

Potential energy surfaces only exist within the Born-Oppenheimer ap- 
proximation. If the approximation were exact then H— D would have no 
dipole moment, because the extra neutron in the frozen deuterium nucleus 
would not affect the electrons. This is why we usually assume that the (elec- 
tronic) potential energy curve (or the effective force constant) of a diatomic 
molecule is unchanged by isotopic substitution. In fact H— D has a dipole 
moment of order 10“^ D (1 Debye = 3.336 x 10“^° Cm); for comparison, 
CH 3 F has a dipole moment of 1.85 D. Hence the approximation is very good 
for H— D, and even better for heavier atoms. 



2.1 Normal modes 

Various approaches to the nuclear dynamics problem summarised in 
equation (2.4) are commonly employed. The simplest is normal mode anal- 
ysis, which requires an orthogonal transformation of coordinates. 

2.1.1 Orthogonal transformations 

In general, we interpret the elements of an n-dimensional vector, v = 
(vi, V 2 , ■ ■ ■ , Vn) as the components of a vector quantity in a particular basis 
of orthonormal {i.e. orthogonal and normalised) unit vectors, Gq,, defined 
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by 



duj 



dr 



dUn 



( 2 . 6 ) 



where r is a position vector that depends upon the coordinate Ua- We must 
distinguish between the components, denoted by the row matrix v, and the 
vector 

n 

V = ^VaBa. ( 2 . 7 ) 

Of— 1 

If we change basis to a new set of orthonormal vectors, e(j,, where 



then 



So that 



3q, — 'y ] Cfja G/3 J 

/3=1 

n n 

v' = ^ v'^CfBaep. 

ct,/3— 1 



OL — 1 



Vf3 



= E<C'/3o 



or 



V = v' C^, 



( 2 . 8 ) 



( 2 . 9 ) 



where superscript T denotes the transpose. The transformation matrix C 
therefore relates the components of a vector quantity in different bases. 

If the components of v and v' correspond to different coordinates then 
the functions /(v) = /(v'C^) must have the same value at any point. 
For example, the potential energy does not depend upon the coordinates, 
but the functional form may be different. If V{x,y) = + 2y^ and u = 



+ y)/V2-. 


, v={y 


— x)j then 






(m,u) 


= (x,y)C, where ^ 


, — 1 , — 1 

1 

1 — I 1 — 1 




V{u,v) 


= 3(u^ + u^)/2 + MU, 






s{x,y) 


= iy^V(x,y),^yV{x,y)) = 


(2x,4y) 


and 


i{x,y) 


= 2xGx+ ^yey, 






g{u,v) 


= (V„1/(m,u),V„I/(m,m)) = 


(3m + u, m + 3m) 


and 


i{u,v) 


= (3m + u)e„ + (m + 3u)ei,. 





( 2 . 10 ) 



The components of (x, y)C are the values of the new coordinates (u, v) and 
the components of the gradient vectors are related by g{x,y) C = g{u,v), 
where Cap = Bq, • e^. The unit vectors in the (u,v) coordinate system are 
defined by 
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dr 



dr 

dv 



dx , 



dv^'^ 



dr 

dv 




( 2 . 11 ) 



where r = xe^ + The gradient at any point is unaffected by the 

transformation: 



i{u,v) = {^u + v)eu + {u+‘iv)ey=2xe^+Ayey = i{x,y), (2.12) 

but it is important to notice that the components of g(u,v) are different 
from those of g(x, y). 

For transformations between orthonormal basis sets C is an orthogonal 
matrix and distances and scalar products are conserved. These results follow 
immediately from the conservation of the scalar product: 

= (Cv;)^(Cv') = (V1)^(V2). (2.13) 



2.1.2 The normal mode transformation 

For motion near a local minimum we expand Fe(X) in a Taylor series: 

ife(X) « Fe(X°) + i £ g^(X„ - Xl){Xp - X°), (2.14) 

ct,/3=l “ ^ 

where the first derivatives vanish for equilibrium at X°. 

A normal coordinate analysis then defines the fundamental vibrational 
modes of the molecule. Within the harmonic approximation the nuclear 
Hamiltonian in equation (2.4) can be written: 

+ Fe(X) = - ^ ^ + (X - X°)^H(X'’)(X - X°), (2.15) 

t=i ^ ‘ 

where 7 Lq,/3(X°) = d'^ Ee{X.^) / dX adX and we have chosen the energy 
origin such that Fe(X'’) = 0. 

Transforming to mass- weighted coordinates qt = X(-\/Mt, where Xj is 
the position vector of nucleus t, simplifies the kinetic energy operator to 
give 



1 1 _ 

T„ + Fe(q) « - ^ + -(q - q°)^ H(q°) (q - q°), (2.16) 

where iLa/?(q°) = d'^Ee{a^) j dqadqp, or Hay = Hap/ y/MaMy where 
is the nuclear mass corresponding to the component qa- Both H and H are 
commonly referred to as Hessian or second derivative matrices. 
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The symmetric 3N x 3N matrix H can be diagonalised using a matrix, 
A, whose columns are the eigenvectors of H: 

3N 

^ ^ a a (3 -^I3'y ~ ^7 ^CK7- 
13=1 



with (3 running from 1 to 3A corresponds to the 7 ’th eigenvector of H. 
Since H is symmetric A is orthogonal and AA^ = I or ^apAj^ = Sa-y 
and Aj^o^Aj^^ — ^a 7 - 

We now define new coordinates, Q^, by the orthogonal transformation 

3N 3N 

Qa — ^ ^ A(^y Q 7 ; SO that Q (3 — ^ ^ Aq/^ Qa- (2.18) 

7 = 1 CK=1 



In terms of these new variables and moving the origin to the stationary 
point in question: 



^ OiV 

9 ^ ' Hap Qa Qp 

a,P=l 



1 V — 

2 dql 



1 



3N 



cy ^ ^ Hap Aa^ Ap^ 
q:,/ 3,7,(5=1 
3iV 



3iV 



^ ^ -^a7 -^aS Qj Qs — ^ ^ ^ Qs' 



ot,'y,6=l 
3N 



5=1 



E 






,/3,7=1 ^Qp^Ql 



■ Aan A, 



q:/3 -^0:7 



^ 3N 






(2.19) 



The overall transformation Qp = Y^^yAapXaV^a therefore reduces 
the Hamiltonian to an uncoupled sum of 3A simple harmonic oscillator 
Hamiltonians, one for each normal coordinate: 



Tn + A'e(Q) 



1 

2 



3N 



E 




(2.20) 



For such a separable Hamiltonian the wavefunction is a simple product and 
the energy is a simple sum. The corresponding classical Hamiltonian is 



, 3N 

2 E Qa I 



( 2 . 21 ) 
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where Qa denotes the time derivative. The solutions have the form 

= CjCOs{ujjt + e^) , ( 2 . 22 ) 

and the general solution for the motion of any nuclear coordinate is 

3N 3N 

Xa = ^ Aa^ Q^j Ma — E Cry COS {ujryt T Cry) /V^. (2.23) 

7=1 7=1 

Motion corresponding to a single coordinate Qr^ is called a “normal mode” . 
The displacements Xa oscillate in phase with frequency and relative am- 
plitudes Aa-y/ y/Ma- A general vibration is a superposition of all 37V modes. 
uj may be thought of as an angular frequency, related to a conventional fre- 
quency, by a; = 2ttu. uA also defines the curvature (second derivative) 
of the corresponding normal coordinate at any point. The normal mode 
approach is straightforward but limited by the validity of the second order 
Taylor expansion of the potential energy. 

3 Describing the potential energy landscape 

3.1 Introduction 

For an Watom system the potential energy is a 3Wdimensional function. 
To refer to a potential energy hypersurface we must embed the function in 
a 3N + 1 dimensional space where the extra dimension corresponds to the 
“height” of the surface. The main problem with treating many-dimensional 
potential energy surfaces is that the number of stationary points grows 
rapidly with size. For a system composed of Na atoms of type A, A^b atoms 
of type B, etc., the Hamiltonian is invariant to all permutations of equivalent 
nuclei and to inversion of all coordinates through a space- fixed origin. The 
number of permutation-inversion isomers of any given configuration could 
therefore be as large as 



2 X TVa! X TVb! X TVc! X •••, (3.1) 

but is reduced if the system possesses any point group symmetry elements 
other than the identity. 

Following Pechukas [43] we associate any point group operation of a nu- 
clear configuration X with a matrix C for which C X = X. For proper 
point group operations (pure rotations and the identity) C can be written 
as a product of matrices associated with the mapping induced by the cor- 
responding rotation, R, and the permutation of nuclear labels required to 
reorder the coordinate vector, P. We can show that any rotation or reflec- 
tion commutes with any permutation by considering how these operations 
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act on the atomic position vectors, X^, within the 3-/V-dimensional vector 
X. For example: 





/ Xi \ 




f ^ 




/ Xi \ 




( XP(1) \ 




X2 




rX2 




X2 




Xp(2) 


R 




= 




and P 




= 




1 j 




iv i-Xa J 




1 j 




V Xp(jv) J 



where P{i) defines the mapping of atomic labels under the permutation 
operation, and r is the 3x3 transformation matrix corresponding to the 
proper operation. Hence 



/ 

RPX=PRX= 



rXp(i) 

rXp(2) 






V rXp(AT) / 



(3.3) 



and if RPX = X then rXp(o,) = Xq,. For such configurations the per- 
mutation corresponding to P X produces a labelled structure that can be 
superimposed on X by the pure rotation associated with the matrix r“^. 

For improper point group operations we have CX = PSX = X where 
the matrix corresponding to the improper symmetry element can be written 
as S = E*R with R a pure rotation and E* the inversion of the nuclear 
coordinates through the origin, i.e. E* = —I. Note that R need not corre- 
spond to a point group operation. Hence P E* R X = X and P E* produces 
a labelled structure that can be superimposed on X by the pure rotation 
associated with r"^. 

For a (non-linear) nuclear configuration with point group of order h 
there are therefore h permutation and permutation-inversion operations 
that leave the configuration X unchanged, aside from possible reorienta- 
tions. The number of distinct points or “permutational isomers” of X is 
therefore nx = 27 Va!A^b!A"c! ■ ■ ■ /h. The ratio of the number of permuta- 
tional isomers of any two configurations of the same molecule, A and B, 
is therefore n^/uB = /le/h-A- The permutation and permutation-inversion 
operations defined above form a group, known as the rigid molecule 

group [44]. The isomorphism between and the usual rigid molecule 
point group, Qp^, is established by the explicit construction given above, 
and the elements of may be obtained in general from the following 
prescription: 



• for each proper operation, TZ, in Qp^ there is a permutation in 
corresponding to the mapping of labelled atoms defined by the action 
of TZ on the rigid molecule considered as a macroscopic object; 
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• for each improper operation, S, in there is a permutation-inversion 
in where the permutation corresponds to the mapping of labelled 
atoms defined this time by the action of S. 

For example, all planar molecules have a CTh reflection element that corre- 
sponds to the inversion, E* , combined with the identity permutation, so 
^RM contains E* . 

In the above analysis we have considered rotations and reflections that 
act on the nuclear framework as a rigid macroscopic object. This is the 
way that the point group is deduced. However, when the symmetries of 
vibrational and electronic states are assigned it is conventional to define 
point group operations that act upon vibrational displacements and elec- 
tronic coordinates relative to a molecule fixed axis system [44,45]. Bunker 
and Jensen distinguish the group composed of the latter operations from 
Qp^. and call it the “molecular point group” [44]. In fact, it was Hougen who 
first established that the rotational energy levels of nonlinear molecules can 
be classified according to irreducible representations of the full molecular 
point group [46,47]. 

There are no strict rules for the growth in the number of stationary 
points corresponding to different structures with system size. The evolution 
is system dependent, but empirical observations [48] and simple theories 
[49-51] suggest an exponential increase in the number of structurally distinct 
minima. Multiplying by the permutational factor therefore suggests that the 
number of minima or transition states for a homonuclear system is likely to 
increase as 

21V! exp (oTV'') , (3.4) 

where a > 0 and b is of order unity. 

In fact it is possible to derive relations between the numbers of stationary 
points of different Hessian indices, the Morse rules [52], and upper and lower 
bounds for the number of stationary points of a given index can sometimes 
be defined [53]. Topological analysis of potential energy surfaces has been 
discussed in some depth by Mezey [54-56]. 

4 Stationary points and pathways 

The most interesting points of a potential energy surface are stationary 
(or critical) points where the gradient vanishes. Non-linear molecules in 
field-free space have six zero normal mode frequencies at a stationary point, 
corresponding to overall translations and rotations that do not affect the 
energy [57]; linear molecules have five. It is worth investigating the origin 
of these zero eigenvalues properly to appreciate the fundamental difference 
between the translational and rotational degrees of freedom [45] . 
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4.1 Zero Hessian eigenvalues 

The potential energy must be invariant to an overall displacement of any 
magnitude, A, in the x direction, which we denote by A where 



f 1 \ 
0 




V : / 



is the unit vector in the x direction. Taylor expansion gives 
C(X°) = V{X° + Ae,) 

= y(X°) + A g(X°)^ e, + H(X'’) e, + . . . , 

where g(X°) = VF(X°), or gc(X°) = 9C(X°)/5X„, and H^f 3 {X°) = 
d^V{X°)/dX^dX/s. 

The terms in each power of A must vanish for any X*^, so 

g(X°fe, = 0, and CH(X°)e, = 0. (4.2) 

The first condition gives a conservation law for the x component of the 
centre of mass linear momentum, and the second implies that e^; is an 
eigenvector of H with eigenvalue zero. The corresponding eigenvector of 
Ha/3 = Hap/ yjMaMp is therefore (VMi, 0, 0, VmJ, 0 . . .)/>/M, where M 
is the total mass, and gives the components of the corresponding normal 
mode, in terms of the mass-weighted coordinates qa- Hence 

Qtrans _ jVfi q\x + \/ M2 q 2 x \ZMjv qNx^ / '/M , 

= {MiXi + M 2 X 2 + 1- MmXn) / Vm, (4.3) 

where Xt is the x coordinate of atom t. is proportional to the x coordi- 

nate of the centre of mass; the factor of 1 /VM appears due to normalisation. 
Analogous results apply for translation in the y and z directions. 

Overall rotations, however, are more complicated. First we need the 
general formula for the rotation of the position vector of atom t, X/, through 
an angle a about an axis defined by the unit vector n: 

Xt = X° cos a -I- n (h • X°) (1 — cos a) + X° A n sin a. 



(4.4) 
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The displacement vector for small |a| and n = (1, 0, 0) for a rotation about 
the X axis becomes 

Xt-X° = -y(X?-h(n-X°))+aX?Ah 

= ^ 0 , azt - -ayt ~ > ( 4 - 5 ) 

where is the z component of the position vector of atom t in the reference 
position, etc. 

Denoting the SWdimensional displacement corresponding to overall 
rotation about the x axis by 



we now obtain 

E(X°) = F(X° + R,) = E(X°) + g(X°)^ R, + IrJ H(X°) R, + . . . 

Equating terms in a to zero gives a conservation law for the component of 
angular momentum about the x axis: 



azl - \c?y\ 
-ay\ - \a^z\ 



az2 — 

-aV2 - 



(4.6) 



/ 0 \ 



g(X^) 



0\T 



z? 

-v\ 

0 

-vl 



V : / 



= 0 , 



(4.7) 



or [ViE(X°)AX?]^+ [V 2 E(X°)AX°]^ + ... = 0, 



where the left-hand-side is the x component of the torque on the system. 

At a general point we cannot equate the terms in o? to zero and ob- 
tain an eigenvector of the second derivative matrix because there are also 
terms in the gradient. If the gradient does not vanish then overall rotations 
are coupled to the intramolecular vibrations and we obtain no further zero 



456 



Atomic Clusters and Nanoparticles 



eigenvalues. However, at a stationary point 



( 0 \ 



H(X°) 



0 

4 

- 2 / 2 ° 



= 0 . 



V : / 



(4.8) 



We therefore deduce that 

QT = (V^(z?giy - y?giz) + V^(z2'72j/ - y2'?2^) ■ /V^, 

= (Ml [z^yi - ylzi) + M 2 {z^y2 - ^ 2 ^ 2 ) ■ • ■) / (4-9) 



is a normal mode with zero eigenvalue, and analogous arguments apply to 
rotations about the y and z axes. The diagonal component of the iner- 
tia tensor, appears due to normalisation. The above translational and 
rotational normal mode vectors are orthonormal if the molecule fixed coor- 
dinate axes coincide with the principal axes and the centre of mass lies at 
the origin. 

The accuracy of the “zero” eigenvalues can be a useful measure of how 
well a geometry optimisation has converged. 



4.2 Classification of stationary points 

The characteristics of any stationary point are determined by the corre- 
sponding Hessian. The number of negative Hessian eigenvalues is termed 
the Hessian index, and this is usually the same as the order defined as 
the number of mutually orthogonal directions in which the energy is maxi- 
mal [58]. Minima are stationary points with no negative Hessian eigenvalues. 
Note that a negative Hessian eigenvalue also corresponds to a negative force 
constant and an imaginary normal mode frequency. A displacement along 
such a coordinate lowers the energy. 

The Murrell-Laidler theorem [59] defines a transition state as a station- 
ary point with a single imaginary normal mode frequency. Consider an index 
two saddle with two negative force constants, ki and /c 2 , for normal modes 
Qi and Q 2 - With all the other coordinates fixed the change in potential 
energy for small displacements away from the saddle is: 

AV = 2(^1 Qi 4 ” ^2 Q2) 9 . 



(4.10) 
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Hence any displacement in this space lowers the energy. The saddle is 
therefore a “hill” in these two dimensions and we can always find a lower 
energy path by walking around it (Fig. 6). 




Fig. 6. Surface and contour plots illustrating the Murrell-Laidler theorem. M, 
TS and S denote minima, transition states and the index 2 saddle, respectively. 

Hence if there is a path connecting two minima via a saddle point of 
index greater than one then it is (almost) always possible to find a lower 
energy path via one or more true transition states. Rate theory is therefore 
usually concerned with passage over true transition states. 



4.3 Pathways 

We can define a steepest-descent path or gradient line, X(s), in terms of a 
parameter s such that |dX(s)/ds| = 1. X(s) as a function of s defines a 
continuous path through nuclear configuration space. Furthermore, 

X(so + ds) = X(so) + ^^^ds + . . . (4.11) 

ds 



Hence, for ds — > 0 we have X(sq + ds) — X(sq) = (dX(so)/ds)ds and the 
corresponding path length is |dX(so)/ds|ds. Since we have defined s such 
that |dX(so)/ds| = 1 the integrated path length between X(si) and X(s 2 ) 
is 



rS2 


dX(s) 


L 


ds 



ds 



= 32 - Si. 



(4.12) 



The parameter s therefore corresponds to the path length. 

For a steepest-descent path the tangent to the curve must be antipar- 
allel to the gradient vector, and the constraint |dX(so)/ds| = 1 defines the 
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constant of proportionality: 



dX g(X) 

ds |g(X)f 



(4.13) 



The minus sign ensures that the potential energy decreases with s. 



4.4 Properties of steepest-descent pathways 

When a reaction path is defined in terms of the steepest-descent 
equation (4.13) a number of rigorous conditions can be deduced about 
its symmetry properties [43,60"63]. These geometrical symmetry selection 
rules complement the orbital symmetry results of a Woodward-Hoffmann 
analysis [64] . The geometrical rules must be rigorously obeyed by a reaction 
path, whereas “forbiddenness” in the Woodward-Hoffmann sense implies a 
large but finite barrier, and can be overcome. 

4.4.1 Uniqueness 

If we assume that all the first and second derivatives of U(X) exist and 
are continuous then every point in the nuclear configuration space, X, lies 
on one and only one solution of (4.13), unless the gradient also vanishes. 
This result follows immediately from the uniqueness theorem of first order 
ordinary differential equations. Suppose we have two different paths, X(s) 
and X^(s), which coincide at some point sq. Then at this point we have 

dX(so) g(X(so)) dX^(so) 

ds |g(X(so))| ds ■ ^ ^ 

Hence the curves are coincident: 

X(so+ds) = X(so) + ^^^ds = X'(so) + 

ds ds 

= X'(so + ds). (4.15) 

A steepest-descent path is therefore uniquely specified by a single point that 
lies on it. This result is not true at a stationary point where |g(X)| = 0 
and equation (4.13) becomes undefined. In general there are infinitely many 
solutions to (4.13) that terminate at any given minimum. 

4.4.2 Steepest-descent paths from a transition state 

We will now prove that for a true transition state there are only two solutions 
to (4.13) with decreasing potential energy starting at Xts- These paths 
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set off in opposite directions along the eigenvector of H corresponding to 
negative curvature. 

Consider an orthogonal transformation using the matrix B that satisfies 

3N 

= eliBa-j. (4.16) 

(3=1 



Bfj^ with (j running from 1 to iN corresponds to the y’th eigenvector 
of H, just as corresponds to the y’th eigenvector of H. Now define 
new coordinates Wa = S/ 3 =i Since B is an orthogonal matrix 

Xa = Ba/sWjs. The potential energy in the Taylor expansion about 

a stationary point at the origin becomes 



^ i H^pB^^BpsW^Ws (4.17) 

ct,/3,'y,6—l 

^ 3N 3A^ 

= - y] B^^elB^sW^Ws =-Y,elWl 



q:,/3=1 



Q!,7,(5=1 



5=1 



^ 3N ^ 

Since Bap 

/3=1 



dX, 



dWp" 



(4.18) 



the steepest-descent equation (4.13) at the stationary point in these new 
coordinates is equivalent to the ?>N equations: 



dW„ 






ds |g| 

In fact, the same paths are generated by 

dW„ 



where du = ds/|g|, because if W(so) = W(mq) then 

bEc((so + ds) = Wq,(so) H ^ ds 4 

ds 

tWg(so)ds 

|g| 

dWo,(Mo) ds 



(4.19) 



(4.20) 



= WM- 



— Wa{uo) 4- , .... 

du |g| 

= Wo,(Mo4-dM) for |ds|, |du| <C 1. 



(4.21) 
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Hence the solutions close to the stationary point (at W = 0) can be written 
in parametric form as [43] 

W^iu) = hh„(0) exp(-e». (4.22) 

Now suppose that the stationary point corresponding to W = 0 is a tran- 
sition state, with the unique negative Hessian eigenvalue (or curvature). 
There are only two independent paths that approach the transition state 
as u ^ — oo: they have Wa = 0 for a yf 1 and Wi(0) either positive or 
negative. An infinite number of steepest-descent paths approach the tran- 
sition state as u ^ oo, all with W\ = 0. The energy rises if we step off 
the transition state in any of these directions. There are therefore only two 
distinct downhill steepest-descent paths leading from the transition state, 
and they begin parallel and antiparallel to the eigenvector corresponding to 
the unique negative Hessian eigenvalue (Fig. 7). 




Fig. 7 . Steepest descent paths (thick) and contours (thin) for the potential energy 
surface V = —x^ + 2y^ . 
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In fact, further restrictions are needed to guarantee that there are only 
two steepest-descent paths leading down from a transition state. Zero 
Hessian eigenvalues other than those corresponding to overall translation 
and rotation can lead to all sorts of interesting surfaces. For example, a 
“monkey saddle” can occur for a cubic surface, as shown in Figure 8. The 
two-dimensional surface defined hy V = —ax^ — (iy^ with a, f3 > 0 has 
Hessian eigenvalues — 2o! and 0 at a; = y = 0, but the energy is not in- 
variant to displacements along the eigenvector corresponding to the zero 
eigenvalue [43]. The steepest descent paths are 

a;(u) = x(0)e^““, y(u) = 1/ (l/y(0) - 3/3 m) , (4.23) 

and an infinite number of steepest-descent paths approach the origin as 
u —> —oo in this case [43]. Real systems generally do not exhibit such 
exotic features: the appearance of an additional zero Hessian eigenvalue at 
a point where the gradient also vanishes exactly would be an improbable 
numerical accident. 




Fig. 8. Contour and surface plot for the monkey saddle exhibited by the function 
— 3x^y. 

There is also nothing to prevent one or both the downhill steepest- 
descent paths starting at a transition state from terminating at a station- 
ary point that is not a minimum. Such features are probably quite com- 
mon [65-70]; the addition of HF to ethene (Fig. 9) is an example [71]. A 
steepest-descent path links transition state 1 to the eclipsed form of fluo- 
roethane, which is the transition state for internal rotation. 

4.4.3 Principal directions 

For any stationary point there are 2{3N — 6) principal steepest-descent 
paths corresponding to positive and negative displacements along the 3N—6 
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transition state 1 transition state 2 minimum 



Fig. 9 . HF addition to C2H4. 



Hessian eigenvectors with non-zero eigenvalues. There is an infinite set 
of non-principal steepest-descent paths involving displacements along more 
than one principal direction. From equation (4.22) we see that 



Wc^ju) 

Wpiu) 



M^c(O) 

Wp{Q) 



exp {-u{ei 




(4.24) 



as M — > 00 for a minimum all the components of W tend to zero except 
for the one corresponding to the smallest curvature. Steepest-descent paths 
therefore approach a minimum parallel to the principal direction with the 
smallest curvature for which they have a non-zero projection [43,72], so long 
as this eigenvalue is non-degenerate. 

We have already noted in Section 4.4.2 that only two steepest-descent 
paths approach a transition state as u — 00 , while paths with Wi = 0 
approach the transition state when u 00 . The latter paths enter the 
transition state parallel to the principal direction of smallest curvature (e^) 
with which they have a non-zero overlap. 



4.4.4 Birth and death of symmetry elements 

Here we consider the symmetry group of a nuclear configuration X°, i.e. the 
point group of rotations and reflections, R. The action of R mapping a 
general 37V-dimensional vector X onto Y may be defined by 

= and X^ = C/ 3 ^Yp. (4.25) 

For symmetry operations of X° 

CX° = X°, (4.26) 

so the point group operations of configuration X° correspond to orthogonal 
matrices for which X° is an eigenvector with unit eigenvalue. Note that C 



D.J. Wales: Energy Landscapes 



463 



includes not only the effect of the point group operation considered as a 
mapping of nuclear coordinates^, but also a possible permutation of nuclear 
labels (see also Sect. 3.1) [43]. Using the chain rule gives 

dV 

or g(Y) = Cg(X). (4.27) 

3^ 02 y 

" H(V) = CH(X)C-. 

If C corresponds to a symmetry operation at X then X = Y and C g(X) = 
g(X). Hence the gradient vector transforms as the totally symmetric ir- 
reducible representation (IR) of the prevailing point group. For a small 
displacement A g(X) from X 

C(X-pAg(X)) = CX + ACg(X) 

= Y-pAg(Y). (4.28) 

Hence, if points X and Y are related by the operation C then so are the 
steepest-descent paths passing through X and Y. Furthermore, if C is 
a symmetry operation of the point X then the corresponding symmetry 
element is conserved along the steepest-descent pathway passing through 
X because C(X -I- Ag(X)) = X -I- Ag(X). However, additional symmetry 
elements may appear when a steepest-descent path reaches a stationary 
point [43]. 

The above proof shows that symmetry operations are conserved for in- 
finitesimal displacements along the gradient. Hence a stationary point must 
have every symmetry operation of the steepest-descent paths that converge 
to it. However, we cannot apply the theorem the other way round because 
the gradient vanishes at any stationary point. Hence, stationary points 
may possess additional symmetry elements that are not conserved along 
connecting steepest-descent paths. 

If we apply all the symmetry operations of a point X, with matrix repre- 
sentations Ri, to another point Y i then the number of symmetry equivalent 
points generated, m, must be an integer divisor of the order, hx, of the point 
group of X, Suppose the point Yi is invariant to s operations of 
comprising a subgroup, Qy ^ , with dimension s. Since we know there is at 
least one operation e Qy^ such that R^Yi = Y^, it follows that all s 
operations in the coset {Ri,^Yj} map Yi to Y^. There cannot be more 



dV 

d^V 

dY^dYp 



^In the conventional view, point group operations are considered to act upon vibronic 
coordinates. 
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than s operations because there is one operation that leaves Y i invariant for 
each operation that maps Yi to Y^. Hence there are precisely s operations 
mapping Yi to each of the {Yi, Y 2 , • . . , Ym}, so that sm = hx, and m 
must be an integer divisor of hx- 

We proved in Section 4.4.1 that a transition state possesses precisely 
two downhill steepest-descent paths that correspond to positive or negative 
displacements along the unique Hessian eigenvector associated with negative 
curvature. Since there are only two such paths we deduce that a point group 
operation of the transition state must either leave both paths unchanged or 
interconvert them. Symmetry operations of the transition state are therefore 
either conserved along both downhill steepest-descent paths, or exchange 
them. New symmetry operations may therefore appear at a transition state 
if they interconvert the two downhill steepest-descent paths. Let Pi and P 2 
be two points on the two steepest descent paths leading from a transition 
state, which are interconverted by a symmetry operation of the transition 
state whose matrix representation is R. Such operations must have even 
order, because any odd power of R applied to Pi must give P 2 , and hence 
cannot be the identity. Furthermore, R^”Pi = Pi for positive integer n, 
and hence all even powers of R correspond to symmetry elements that are 
conserved along both sides of the path. Similarly, all odd powers of R must 
interconvert the two sides. A new symmetry operation of the transition 
state must therefore also interconvert the stationary points ( “product” and 
“reactant”) at the two termini of the downhill steepest-descent paths. This 
situation can only arise if the pathway connects different permutational 
isomers of the same structure (a “degenerate” rearrangement [73]). 

In the degenerate rearrangement of an icosahedral Arss cluster via a 
cuboctahedron six five-fold axes are created at the two minima and three 
four-fold axes at the transition state. The conserved symmetry elements are 
those of the point group Th (order 24), whilst the minima belong to point 
group /h (order 120) and the transition state is Oh (order 48). 






Fig. 10. Rearrangement of an Arss cluster. 
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Clearly if a reaction pathway links different structures (i.e. not permuta- 
tional isomers) then no new symmetry elements can appear at the transition 
state. If the transition state does possess additional symmetry elements then 
it must have exactly twice as many symmetry operations as are conserved 
along the path, as illustrated above for Arss. If g symmetry elements are 
conserved along the steepest descent paths then the point group of the tran- 
sition state also contains precisely g such elements, which form a subgroup 
of C/ts, denoted If also contains an operation that interconverts the 
two paths, R, then the product of R with any element conserved along the 
path must also interconvert the two sides. These operations must all be 
distinct and form a coset of the transition state point group [74] denoted 
R^^^. Every element of belongs to or R^tti the two sets are 
interconverted by any member of Hence the order of is 2g. 

4.5 Classification of rearrangements 

We saw in Section 4.4.4 that the transition state for a degenerate rear- 
rangement, which links permutational isomers of the same structure [73], 
can possess a symmetry element that maps one side of the path onto the 
other. This symmetry element is only present at the transition state. 

Degenerate rearrangements are greatly outnumbered by non-degenerate 
pathways, but constitute an interesting minority. Transition state 2 in 
Figure 9 for the internal rotation of ffuoroethane has the same point group 
as the minima it connects. However, the mirror plane in the transition state 
is not conserved along the path, and interconverts the corresponding min- 
ima. Degenerate rearrangements where the two sides of the path are related 
by a symmetry operation of the transition state are known as symmetric 
degenerate rearrangements [75]. For HE dimer this symmetry is evident in 
the plot of energy versus path length shown in Figure 11. 

Asymmetric degenerate rearrangements are also possible, where the two 
minima have the same structure but the steepest-descent paths are not 
related by symmetry [75]. The energy profile is also asymmetric, and an 
example is shown for water trimer in Figure 12 [76]. 

The results of Section 4.4.4 may now be used to eliminate from consid- 
eration mechanisms whose pathways violate the symmetry results. For a 
non-degenerate rearrangement the stationary points at the termini of the 
steepest-descent paths descending from a transition state (the “products” 
(P) and “reactants” (R)) must possess all the symmetry elements of the 
transition state. Hence, (a) if the transition state is linear, the products 
and reactants are linear, (b) if the transition state is planar, the products 
and reactants are planar, (c) if the transition state is optically inactive, 
the products and reactants are optically inactive. Result (b) follows be- 
cause a necessary and sufficient condition for planarity is that E* S 
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path length/bohr 

Fig. 11. DZP/MP2 energy profile for the symmetric degenerate rearrangement 
of HF dimer. 




minimum 



Fig. 12. DZP/MP2 energy profile for the asymmetric degenerate rearrangement 
of (H20)3. 
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(see Sect. 3.1). Since the products and reactants may possess additional 
symmetry elements, an optically active transition state may link optically 
inactive products. However, the geometrical symmetries of the transition 
state for a non-degenerate rearrangement must also be present in the prod- 
ucts and reactants, and must therefore form a subgroup in each of the point 
groups and The size of the largest common subgroup of Qp and 
Qp therefore sets an upper bound on the size of the transition state point 
group [43]. 

Pechukas has derived some further symmetry constraints that apply 
when one or both of the products and reactants is bimolecular [43]. For 
an association or dissociation reaction where the bimolecular species are 
different the transition state can possess at most a single rotation axis and 
a mirror plane that both contain the two centres of mass. These are the 
only symmetry elements that can exchange nuclei within two different sep- 
arated molecules [43]. Additional transition state symmetries are possible 
for symmetric association or dissociation [43]. 

There have also been efforts to use symmetry in a predictive sense in 
analysing rearrangement mechanisms. A critique of earlier attempts is given 
by Bone et ai, along with a formulation in terms of the molecular symmetry 
group [77]. Given two permutational isomers of the same structure group 
theory can be used to determine the symmetry operations which could be 
generated at an intervening transition state [77-79]. These results have 
not yet been extensively exploited, perhaps partly because of the factorial 
growth in permutational isomers with system size, which makes calculations 
for all possible pairs of minima rather laborious. 

4.6 The Mclver-Stanton rules 

Mclver and Stanton proved a series of rules governing the properties of 
the “transition vector”, which are either equivalent to or follow from the 
results of the previous section [62,63] . Here the transition vector corresponds 
to the Hessian eigenvector corresponding to the unique negative Hessian 
eigenvalue. 

• Theorem I. The transition vector cannot transform according to a de- 
generate irreducible representation of the transition state point group; 

• Theorem II. The transition vector must be antisymmetric under a 
transition state symmetry operation that converts reactants into prod- 
ucts; 

• Theorem HI. The transition vector must be symmetric with respect 
to a symmetry operation that leaves either reactants or products un- 
changed; 
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• Theorem IV. If the transition vector for the reaction R\ — > P\ is 
symmetric under a symmetry operation O that converts reactants R\ 
into the equivalent reactants i ?2 and P\ into P 2 then there exist lower 
energy transition states for the pathways R\ — > R 2 and P\ P 2 
[case (a)]. If the transition vector is antisymmetric under O then 
there exists a lower energy transition state for the pathway R\ P 2 
[case(b)]. 

Theorem I is equivalent to the Murrell-Laidler theorem [59] : if the transition 
vector belongs to a degenerate irreducible representation of the transition 
state point group then there would have to be more than one negative 
Hessian eigenvalue. Theorem II is the case where a symmetry operation 
of the transition state maps one side of the path onto the other, while 
Theorem III applies to any symmetry operation that is conserved along the 
whole path. 

Theorem IV will apply if a transition state links either (a) two transi- 
tion states A and B for which OA=A and OB=B, or (b) the transition 
state mediates a symmetric degenerate rearrangement of permutational iso- 
mers Ai and A 2 where OAi=A 2 . In case (a) ORi=R 2 and OPi=P 2 , so 
transition states A and B mediate symmetric degenerate rearrangements 
of the products and reactants. Theorem IVa is realized for the FNH 2 F“ 
molecule in Figure 14. In case (b) the transition states Ai and A 2 mediate 
the non-degenerate rearrangements of Ri to Pi and R 2 to P 2 , respectively. 

A number of corollaries are possible. For example, no structure can be 
a transition state if it possesses a C 3 rotation or other odd order operation 
that interconverts reactants and products [62,63]. 



4.7 Coordinate transformations 



The steepest-descent paths defined above by equation (4.13) have a rigorous 
definition. However, the literature contains references to the “minimum en- 
ergy path” (MEP), “intrinsic reaction path” (IRP) and “intrinsic reaction 
coordinate” (IRC) . Often authors simply refer to the “reaction coordinate” 
or the “reaction path” and MEP, IRP and SDP are used interchangeably. It 
is also common for these acronyms to refer to paths calculated not by solu- 
tion of equation (4.13) but instead from mass- weighted coordinates defined 
by qa = Xa\/M^-. 



dq(s) ^ g(q(s)) 

ds |g(q(s))|’ 



(4.29) 



where Ma is the mass of the atom corresponding to nuclear coordinate a. 
The solutions are loosely referred to as steepest-descent paths in “mass- 
weighted coordinates” . 
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In fact properties of a potential energy surface such as steepest-descent 
paths, principal directions and normal mode frequencies should be coordi- 
nate independent [80,81]. If this were not the case then we would obtain 
different dynamic and thermodynamic properties from different sets of co- 
ordinates. However, if the metric tensor of the coordinate system does not 
correspond to the identity then extra care is needed to ensure that path- 
ways and frequencies remain coordinate independent. For example, when 
second derivative matrices (Hessians) are constructed using ordinary partial 
derivatives of the energy the eigenvalues can exhibit coordinate dependence. 
This result does not mean that one particular coordinate system is correct 
and that others are not. Rather, it means that one should not use ordi- 
nary partial derivatives to construct the Hessian when the metric does not 
correspond to the identity matrix [82]. 

Relations between physical properties that are independent of the ref- 
erence frame are obtained by taking proper account of metric tensors and 
using covariant derivatives [80,81]. In general curvilinear coordinates, Z^, 
the arc length, ds, is obtained from 

ds^= ^ Gc,i3dZ^dZf3 = Y,dXl (4.30) 

a, 0—1 ot 

and the length of a vector, L, with components, Va, is defined as 

L^= ^ (4.31) 

a, 0—1 

Gap and are the covariant and contravariant forms of the metric tensor 
for which^ GapG^'^ = Sa-y- If Gap = Sap we have a Euclidean space, 

otherwise the space is termed Riemannian. 

To find Gap for spherical polar coordinates: 

ds^ = dx^ + dy^ + dz^ = dr^ + r^d0^ + sin^ 6»d((i^, (4.32) 

and so Grr = 1, Gee = Gpp = r^sin^ 9 and all the other components 
are zero. In general 




^Superscripts are conventionally used for contravariant components, and subscripts 
for covariant components. The differential coordinates dZi, dZ2, . . . actually transform 
in a contravariant fashion, but subscripts are used here to maintain consistency with the 
rest of these notes. 
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The example of a free diatomic bound by a harmonic potential has been 
used in previous work to highlight the apparent dependence of normal mode 
frequencies on the coordinate system [83]. If the potential is = k{r — 
j2 with = (ri — r 2 )^ then the Hessian eigenvalues are 

0, 0, 0, 2fc(l-ro/r), 2A:(l-ro/r), 2k, (4.34) 



where unit masses have been assumed for the two atoms. The three zero 
eigenvalues correspond to overall translation, while the two rotations are 
coupled to the bond vibration by a term linear in the gradient, as expected 
[45]. Only for a stationary point with r = tq do the two rotational modes 
exhibit zero eigenvalues. 

The problem can be expressed in many other ways, for example in terms 
of 

ri and ri + r(sin0cos((),sin0 sin((), cos0), (4.35) 

or R± r(siii0cos((>,sin0 sin^, cos0)/2, R=(X,Y,Z), (4.36) 

where the corresponding coordinates are the components of ri and r, 9 and 
cj) in the first case and X, Y, Z, r, 9 and (j) in the second case. Both of these 
coordinate systems give the same eigenvalue spectrum as equation (4.34) 
when the Hessian is calculated from the covariant derivatives 



gy 

dqk 

k 

is a Christoffel symbol of the first kind, 

^ \ ' Akl 

ij- 2^^ 



dAji dAg _ dA^j 

dq^ dqi dq^ ’ 



(4.37) 



(4.38) 



Aij are components of the covariant metric tensor and A^^ are components 
of its inverse, the contravariant metric tensor. For example, with the coor- 
dinates {X, Y, Z,r,9, 4>} the metric tensor is diagonal with components 

1 j-2 1 

Axx = Ayy = Azz = 2, Arr = Ass = sin^ 9. 

(4.39) 

The generalised eigenvalue problem that accounts for the non-trivial metric 
tensor is [81] 

^(H,, - \A,,)d> = 0, (4.40) 

3 

and it is easily verified that solution yields the eigenvalues in (4.34) for both 
the coordinate systems defined above. 
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The use of covariant derivatives and the appropriate metric conserves 
all the Hessian eigenvalues for proper coordinate transformations, not only 
those associated with translation and rotation. The covariant first deriva- 
tive is just the usual partial derivative. Coordinate dependence has been 
exploited in a number of studies concerning geometry optimisation [84-87], 
where the use of simple partial derivatives can produce more efficient con- 
vergence. This approach is legitimate, since the stationary points that are 
located are stationary points in any proper coordinates. 



4.7.1 “Mass-weighted” steepest-descent paths 

Coordinate independent steepest-descent paths can be defined by [81]: 



3N 

/3=1 



dX/3(s) 

ds 



dV 

dX^ 






dX, dX, 



(4.41) 



For the mass- weighted coordinates, qa-, defined above, the metrics have 
non- vanishing components Gaa = 1/Mq, and G““ = M“. Steepest-descent 
paths should then be defined from equation (4.41) as 



1 dqgjs) 
Ma ds 




(4.42) 



Since d/dq^ = {\j ^ M^')d j dXp the above equation is equivalent to that for 

Steepest-descent paths calculated from (4.41) are invariant to proper 
coordinate transformations q = CX where the matrix C is non-singular. 
Paths calculated from equation (4.29) are different, and were popularised 
by Fukui and coworkers because they correspond to the solutions of the 
classical equations of motion in the limit of zero kinetic energy [60,88,89]. 

We have previously defined the normal coordinates, Qa, for which 



3N 

qa — ^ ^ ^CK 7 Q 7 and Q /3 

7=1 

3N 

where Ha/3 
/ 3=1 



3N 

^ ^ Aaf3 Qa 

a=l 

Aa'^ : 



(4.43) 
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3N 3N 



and 'y ^ H Qa ^13 — y ^ Qg- 




We can therefore repeat the analysis leading to equation (4.22) for the 



’mass-weighted” path to give 



Qa{u) = Qa{0) exp{-ujlu) 



(4.44) 



in the vicinity of a stationary point. Hence, the “mass- weighted” paths 
defined by (4.29) leave the transition state parallel or antiparallel to the 
normal mode corresponding to the unique imaginary frequency, Q\. 

Equation (4.43) shows that the displacements in terms of the mass- 
weighted coordinates qfj are parallel or antiparallel to the corresponding 
eigenvector of H. In contrast, true steepest-descent downhill pathways leave 
a transition state parallel or antiparallel to the unique eigenvector of H 
corresponding to the negative eigenvalue. 

All the rules for conservation, creation and destruction of symme- 
try elements along true steepest-descent paths carry over to the “mass- 
weighted” paths. In principle, a surface could be constructed where the 
“mass-weighted” and true steepest-descent paths connect different station- 
ary points. However, in practice the pathways usually do not look very 
different to the eye. For a homonuclear system they are, of course, the 
same. 

4.7.2 Sylvester’s law of inertia 

The inertia of a matrix, i, is defined by f+, i- and iqi the number of positive, 
negative and zero eigenvalues, respectively. Although the eigenvalues of H 
are clearly different from those of H the numbers of positive, negative and 
zero eigenvalues are preserved. Hence the “mass- weighting” formulation 
cannot change the index of a stationary point, as we now prove. 

In general, if two symmetric matrices A and B are related by 



where S is a non-singular matrix (i.e. exists) then A is said to be 
congruent to B. Since B = S~^A(S”^)^, B is also congruent to A. We 
will now prove that the inertia of A, «(A), is the same as the inertia of B, 
i(B) (Sylvester’s theorem) [90]. 

We are interested in the case where B = H, A = H and Saji = 
Sa/ 3 / \/Ma ■ First we show that A and B have the same number of zero 
eigenvalues. Let the fo(A) linearly independent eigenvectors of A with zero 



A = SBS^, 



(4.45) 
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eigenvalues be denoted ai , a. 2 , ( a) • Then we cannot find coefficients 
Cl, C 2 , . . . , Ciq(a), not all zero, satisfying 



io(A) 

^ ^ Cc^aQ — 0. 

a=l 



(4.46) 



Since Aao, = 0 we must have SB S^ac, = 0 and B S^ac, = 0 . Hence the 
io(A) vectors S^a°, a°, . . . , ^io(A) eigenvectors of B with zero 

eigenvalues. Furthermore, multiplying equation (4.46) on the left by we 
deduce that there are no coefficients, not all zero, satisfying 

*o(A) 

^c„S^a°=0. (4.47) 

Hence the io{A) vectors S^a°, S^a^,..., are linearly indepen- 

dent and io(B) > io(A). However, we may interchange the roles of A and 
B to obtain io(A) > zq(B), and so io(A) = io(B). 

In fact we constructed eigenvectors corresponding to zero eigenvalues 
explicitly for the matrices H and H in Section 4.1, and so we know that 
there must be equal numbers in this case. 

Now denote the i+(A) linearly independent eigenvectors of A with pos- 
itive eigenvalues by a)*" , aj , . . . , a)^ . Then we cannot find coefficients 

Cl, C 2 , . . . , Ci^(A)i not all zero, satisfying 

i+(A) 

(4.48) 

a— 1 

Let X = ^ Cq, a+ ^ 0. Then 

»+(A) 

x^Ax= ^2 (4.49) 

Of— 1 

Hence x^ S B S^x > 0, or B y > 0 where 

i+(A) 

y= CaS^a+. (4.50) 

a=l 

Hence there are at least *-i-(A) linearly independent eigenvectors of B with 
positive eigenvalues, and so *-i-(B) > i+(A). As above we may interchange 
the roles of A and B to give *-i-(A) > i+(B) and finally *-i-(A) = i+(B). 
This completes the proof, since i- = n — i+ — iq for an n x n matrix. 
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4.8 Branch points 

The curvature along a steepest-descent path changes from negative to posi- 
tive as we pass from the transition state to a minimum. Hence it must pass 
through zero somewhere along the path. The curvature associated with 
principal directions perpendicular to the steepest-descent path is normally 
positive, but it can change from positive to negative if the path leads to 
another transition state. The point at which the curvature along an or- 
thogonal principal direction passes through zero is called a branch point or 
valley-ridge inflection point (Fig. 13) (see also Sect. 4.4.2). 




Fig. 13. Schematic view of a branch point. 

Such features challenge our intuitive notion of a “minimum energy 
path” (MEP). The steepest-descent path, as defined above, continues along 
the ridge because the gradient has no component to send the path down 
either side of the branch point. Intuitively, we might feel that the MEP 
should depart from the steepest-descent path at such points. However, it is 
not possible to define such a path in terms of the solution of a differential 
equation, as we could for the steepest-descent path. Hence, if we define the 
“minimum energy path” as a steepest-descent path then it must go over 
the ridge [91]. To produce a definition more in keeping with intuition we 
would have to permit the path to bifurcate. If the system is infinitesimally 
perturbed from the true steepest-descent path then it will indeed follow 
such a trajectory. Unfortunately there does not seem to be a rigorous defini- 
tion of the reaction path that allows this to happen. Hence, although branch 
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points are sometimes referred to as “bifurcation” points, a bifurcation in 
the steepest-descent path cannot occur unless the branch point is also a 
stationary point [91]. Such a situation could only occur by accident in a 
realistic molecular system. 

If a bifurcation point is defined as a point with two downhill steepest de- 
scent paths, then such a point must be a stationary point [58,91]. However, 
branch points are often referred to as bifurcation points in the literature. 



transition state 2 




transition state I 



branch point 



branch point 



Q" transition state 2 



Fig. 14. Schematic potential energy surface for FNH 2 F . 

A branch point must occur whenever a steepest-descent path connects 
two transition states: the curvature along the initial reaction coordinate 
must become positive, whilst the curvature correlating with the reaction 
coordinate of the new transition state must change from positive to nega- 
tive. There is a branch point between the two transition states involved in 
the addition of HE to ethene (Fig. 9) [71]. Another example is provided 
by rearrangements of FNH 2 F“ (Fig. 14) [92]. For this system two equivalent 
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Fig. 15. Evolution of the lowest eigenvalues for FNH2F along the path connect- 
ing the C2v transition state (left) to a Cs transition state (right). 



branch points exist between the C 2 V transition state in the middle and 
the two transition states with Cs symmetry. Cs symmetry is preserved 
along the paths connecting these stationary points. The evolution of the 
lowest Hessian eigenvalues is shown for the C 2 V — *■ Cs path in Figure 15. 
The corresponding eigenvectors are S(ymmetric) and A(ntisymmetric) with 
respect to the conserved mirror plane. The motion perpendicular to the 
steepest-descent path at a branch point is usually associated with symmetry- 
breaking. For the above path the perpendicular direction whose curvature 
passes through zero at the branch point corresponds to loss of the conserved 
mirror plane. Notice also that the C 2 V transition state has an additional 
symmetry element, namely a C 2 rotation axis. 

Since the gradient vector transforms as the totally symmetric irreducible 
representation Fq (Sect. 4.4.4), a branch point must occur whenever the 
eigenvalue associated with a Hessian eigenvector that does not transform 
according to Fq passes through zero. 

Ramquet et al. have recently argued that the location of valley-ridge in- 
flection (or branch) points on a steepest descent path depends upon the coor- 
dinate system [58] . According to their definition, such points occur when the 
curvature vanishes in a direction, d, perpendicular to the steepest-descent 
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path. Coordinate independent curvatures can be defined using covariant 
derivatives as [82,93] 



12 ij ^ij 



(4.51) 



Branch points defined according to this definition are also coordinate inde- 
pendent. 

Ramquet et al. also note that the vanishing of a curvature perpendicu- 
lar to a steepest-descent path need not correspond to a vanishing Hessian 
eigenvalue, and conclude that branch points need not possess a zero Hessian 
eigenvalue [58]. If we expand the direction, d, in terms of the Hessian eigen- 
vectors, by where Hoi = ef then 



3N 

= (4.52) 

where Ci = and d is assumed to be normalised. The condition 

12i=i =0 results when = 0 for a particular a, with Cq, = 1 and 

d^a = 0. If one (or more) of the ef are negative there can be solutions 
where d is not simply parallel to a Hessian eigenvector with a zero eigen- 
value. An example occurs in Figure 16 for steepest-descent paths with 
broken symmetry lying close to the path that links TSl and TS2. 

Isotopic substitution has some interesting effects for surfaces with branch 
points [94] . Since the Born-Oppenheimer PES is independent of the atomic 
masses, the permutation of isotopes of the same element is also a symme- 
try operation. Hence, the coordinate independent steepest-descent paths 
defined by (4.41) are unchanged. However, the path corresponding to the 
intrinsic reaction coordinate (IRC), defined from (4.29), does not possess 
this additional symmetry, and deviates from the steepest-descent path ob- 
tained from (4.41) [94]. A branch point may therefore occur on the IRC in 
Cl symmetry. A similar effect can be seen for the model surface shown in 
Figure 16. There is a branch point on the unique steepest-descent path link- 
ing TSl and TS2, as expected, while nearby paths corresponding to broken 
symmetry also exhibit a perpendicular direction whose curvature changes 
from positive to negative. 

5 Tunnelling 

Tunnelling is the quantum mechanical situation where systems have a finite 
probability of being found in classically forbidden regions. The simplest 
example is a particle with total energy Ei tunnelling through a barrier 
of height E 2 > El. The wavefunction in the barrier region is a decaying 
exponential and the transmission probability is largest when the energy 
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Fig. 16. Steepest descent paths and Hessian eigenvalues for a model potential en- 
ergy surface. The top panel shows some paths superimposed on the surface itself, 
while the lower two panels show the curvatures parallel and perpendicular, C\\ 
and C_L , to the path linking the two transition states (middle) and a path slightly 
displaced from it (bottom). The latter path avoids TS2, and finally converges 
to one of the two equivalent minima. It exhibits a branch point near to the one 
on the symmetry conserved path, and two ridge-valley inflection points: the first 
near TSl where the path turns through 90°, and the other in the region between 
TS2 and the minimum. 



difference and particle mass are small and the barrier is narrow. Hence 
tunnelling is only likely to give rise to observable effects for light atoms and 
short pathways with small barriers. 

The classic example of tunnelling manifesting itself in spectroscopy is 
the inversion doubling of the ammonia molecule [95,96]. The umbrella 
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motion takes the system through the planar D^h geometry from one 
minimum to a permutational isomer (Fig. 17). If the two potential wells were 
isolated they would support identical rovibronic energy levels. However, the 
barrier is sufficiently small for the localised wavefunctions in the two wells 
to interfere. The correct wavefunctions are therefore the sum and difference 
of the separate wavefunctions for the two wells and are delocalised. 

Compare the interaction of atomic orbitals to give molecular orbitals. 
If two hydrogen atoms are far apart then we can treat each one as iso- 
lated but as they approach the Is orbitals interact. The molecular orbitals 
are proportional to the sum and difference of the Is functions, and the 
energy levels become a ± /3 in the Hiickel approximation, where a is the 
isolated energy level and f3 is the off-diagonal Hamiltonian matrix element 
between the two Is orbitals. We can treat the double- well problem for 
ammonia in just the same way, forming a delocalised linear combination 
of localised wavefunctions, analogous to a linear combination of atomic or- 
bitals. If we denote the localised wavefunctions by iIji £^nd ■02 and the 
matrix elements of the Hamiltonian by a = {'tjji\'H\tjji) = and 

(3 = (' 0 i|?i|^/’ 2 ) = then the resulting secular equations are entirely 

analogous to the hydrogen molecule problem. The Hiickel- type approxima- 
tion may be justified for tunnelling problems because of the exponential 
decay of the wavefunctions in the barrier regions, although sometimes bet- 
ter approximations are needed [97]. 

For NH 3 the localised levels are split into pairs - hence the term inversion 
doubling. The splitting of the lowest level is 0.793 cm“^, and the splitting 
increases for higher energy states because the barrier is narrower and lower. 

Tunnelling between minima with different structures (non-degenerate 
tunnelling) is generally assumed to lead to much smaller effects than 
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transition state 
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Fig. 18. Degenerate rearrangement of the HF dimer. 



tunnelling between equivalent minima. The reason is basically that only 
the energy levels of permutational isomers of the same structure are in res- 
onance. 



5. 1 Tunnelling in ( HF )2 

The symmetric degenerate rearrangement of the (HF )2 Cg minima via a 
C 2 h transition state is shown in Figure 18 [98]. Note the appearance and 
disappearance of the C 2 symmetry element at the transition state. The 
secular determinant for each pair of degenerate wavefunctions is 



a-E 13 
(3 a-E 



= 0 , 



(5.1) 



with solutions E± = a ± /3. The splitting is therefore 2/3. The ground 
vibrational state of the dimer is subject to a tunnelling splitting of 0.66 cm“^ 
[98]. However, this splitting decreases to 0.22 and 0.23 cm“^ for the excited 
vibrational states corresponding to one quantum of vibration in the non-H- 
bonded and H-bonded HF monomers, respectively. The dependence of the 
splitting of low-frequency “floppy” modes upon excitation of high-frequency 
modes can often only be explained by detailed calculation [99-106]. 



5.2 Tunnelling in (H20)3 

New experiments in the far-infrared region have recently become possible 
that can probe the low frequency modes of a van der Waals complex and re- 
solve rotational transitions [107-110]. Results for the water trimer, (H20)3, 
have generated considerable theoretical activity in the last few years, and 
the spectra have now been assigned in some detail [111-117]. The lowest 
energy rearrangement mechanism of the cyclic global minimum is known as 
the “flip” (Fig. 19) [76,118]. The global minimum is a “frustrated” struc- 
ture because two “dangling” hydrogen atoms must be next to each other. 
The flip is an asymmetric degenerate rearrangement [75]. Each minimum 
can rearrange via two distinct flips, corresponding to either of the adjacent 
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transition state 
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Fig. 19. Single flip rearrangement of (H20)3. 



“dangling” hydrogens. This process is facile, as no hydrogen bonds are made 
or broken, and is localised at a single hydrogen atom. The mechanism leads 
to a low effective mass, and, in combination with the low barrier, produces 
large tunnelling splittings on the order of 10 cm“^. 

It is not hard to show that three successive flips connect any permuta- 
tional isomer to its enantiomer, while six flips can take us through a closed 
set of six such isomers [76]. The secular problem for the energy levels is 
analogous to the treatment of the tt system of benzene in the Hiickel ap- 
proximation, with tunnelling levels at a ± 2/3 (non-degenerate) and a ± /3 
(doubly degenerate), in good agreement with experiment and more accurate 
dynamical calculations [119,120]. 

6 Global thermodynamics 

6.1 The superposition approximation 



is the vibrational Hamiltonian in the harmonic approximation for a station- 
ary point at the origin with normal coordinates Qa- The total microcanoni- 
cal (constant N, V and E) energy density of states for one minimum, H(i3), 
may be obtained from the separate kinetic (momentum) and configurational 
densities of states, Hk(£’k) and Hc(i3c), respectively. H(i3) is the differen- 
tial of G{E), the total phase volume with energy less than E, with respect 
to the total energy. Hence 




3Af-6 



( 6 . 1 ) 




( 6 . 2 ) 
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and the total phase volume is the convolution of either the momentum phase 
volume with the configurational density of states, or vice versa: 

n,{E,)G^{E - E,)dE, = [ Gc{E - EK)nK{EK)dEK. 

Jo 

To find Gc(Ac), X^a‘^aQa/2 is transformed into a sum of squares using 
the substitution Ca = ‘^aQa/2- The standard formula for the volume of a 
hypersphere radius Rin k = 3N — 6 dimensions is (T{k(2 + 1) [121], 

and the Jacobian factor of 1/ JlfLi then gives 




Gc{E,) = 



(Ac - 



(27r)-/2r(«/2+i)nLi^«n;=i^. 



jN ’ijl'’ 



(6.3) 



where Wa = ^r^Va-, T is the Gamma function and the mass of atom t is mt- 
Hence 

(Ac - A°)'"/2 -i 



flc(Ac) = 



(27r)-/2r(^/2)nlim'^' 



(6.4) 



Similarly, 



Gk(Ak) = 



(27t) ' E^ 

T(k/2 + 1) 



and Hk(Ak) = ^ ■ (6.5) 

T(k/2) 



Note that the Jacobian for the overall transformation of position and mo- 
mentum variables to normal coordinates is unity. The total microcanonical 
phase volume may therefore be obtained as 



G(A) 




A°)GK(A-Ac)dAc, 



r(K/2)r(K/2 + l)n>, Jeo 

{E-E°Y 

r(K-Hi)rij=i^'i’ 



( 6 . 6 ) 



and so fl(A) = (6.7) 

The above formula gives the harmonic approximation for the total energy 
density of vibrational states associated with a single minimum of a system 
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with fixed centre of mass and fixed orientation with respect to space fixed 
axes. A factor of 1/h'^ gives the semiclassical approximation. Both G{E) 
and ^1{E) vanish unless E > and this behaviour may be guaranteed by 
including a step function Q{E — E^) in (6.6) and (6.7). 

In fact, equation (6.7) may be derived more quickly using the canonical 
partition function for a harmonic oscillator for each vibrational mode and 
taking an inverse Laplace transform. However, some of the intermediate 
results, such as Hk(A'k) will be employed in later derivations. 

To calculate the vibrational density of states for the whole system, all 
the minima need to be considered. In the superposition approximation we 
simply sum the density of states over all the minima low enough in energy 
to contribute [122]: 



n{E)= 

E°<E 



tIs{E-E°Y-^ 

r(«)n;=iK ’ 



( 6 . 8 ) 



where the sum is over all the structurally distinct minima on the surface 
and ris, the number of permutational isomers of minimum s, is given by 
Us = 2N\/hs where hg is the order of the point group of s (Sect. 3.1). The 
superposition approach was originally considered in the early days of cluster 
simulation [123-125], but has only recently become practical for larger clus- 
ters [122,126]. The first attempts to implement this decomposition of phase 
space over minima [127] were not very successful because of the difficulty 
in obtaining sufficiently large samples of minima, and because the effect 
of point group symmetry was omitted [125]. The latter mistake roughly 
doubles [1] the predicted melting point for LJ13, the thirteen-atom cluster 
bound by the Lennard-Jones potential [128]. 

The canonical (constant N , V and T) partition function is obtained 
from Laplace transformation of the energy density of states, and hence the 
superposition approximation gives 

0(T) = (6-9) 



where (3 = 1/kT, k is the Boltzmann constant, and Vg is the geometric mean 
normal mode frequency of minimum s. 

Thermodynamic properties can now be derived straightforwardly from 
fl(E) and Q(T). For example, the thermodynamic definition of temperature 
as T = l/{dS/dE) in the microcanonical ensemble gives: 



1 

T 



k V T 

n(E) ^ 

^ ’ E°<E 



ng{E-E°r-^ 

r(«-i)n;=iK’ 



(6.10) 
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while the average values of the internal and kinetic energy in the canonical 
ensemble are: 



{E) 

{E-k) 






KkT 



( 6 . 11 ) 



The temperature and heat capacity in the microcanonical ensemble can also 
be expressed in terms of expectation values of the kinetic energy, using the 
probability distribution for Ek derived in (7.5). 

An alternative microcanonical kinetic temperature, Tk, may be de- 
fined in terms of the mean kinetic energy as Tk = 2 (Ek) /kn. Since 
G'k(71^k) = ‘^EyS^]^{E]^) / K (from (6.5)) it can also be shown that (Ak) = 
KG{E)/2fl{E), and hence that [129,130] 



T{E) = Tk{E)/ 



1-k 




( 6 . 12 ) 



so that the thermodynamic and kinetic definitions of the microcanonical 
temperature differ by a leading term of order N~^ , as expected [131-133]. 
Umirzakov has shown that a van der Waals loop in Tk{E) necessitates a 
loop in T{E), while the converse relation need not be true [130]. In fact, 
Tk corresponds to the temperature determined by the energy derivative of 
the entropy when it is defined as S = fclnG(if), because 

f dklnG{E) \ kn{E) kn 1 

V dE )^y G{E) 2(Ak) Tk’ ^ ^ 

The alternative definitions of entropy and temperature are identical in the 
thermodynamic limit [134]. 

Properties calculated from the superposition approximation are not 
equivalent to a weighted sum over minima. For example, 

Psln^s{E) = In fl{E) + Ts In 

E°<E e°<e 

= lnfl{E)+ PsinPs. (6.14) 

e°<e 



^s{E) 

n{E) 



Hence the entropy calculated as a weighted sum differs from the true entropy 
by —kJ2E«<E Es InPs > 0, which is the entropy associated with the choice 
of minima in which the system may find itself. 
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6.2 Sample incompleteness 

The difficulty with equation (6.8) is that for all but the very smallest clus- 
ters the sum involves an impractically large number of minima. For the 
LJ55 cluster there may be as many as 10^^ terms. In such a case, as it is not 
possible to obtain a complete set of minima, a statistically representative 
sample is needed. A large set of minima can be obtained by systematic 
quenching from a high energy molecular dynamics (MD) trajectory. How- 
ever, this gives a greater proportion of the low energy minima than of the 
high energy minima. Consequently, if the sample is used in equation (6.8) 
it is likely to underestimate the density of states due to the high energy 
minima, and so be inaccurate at high energies. 

A method is needed that corrects for the incomplete nature of the sample 
of minima. This correction can be achieved by weighting the density of 
states for each known minimum by ps, the number of minima of energy 
for which the minimum s is representative [122,132]. Hence, 



n(E)= 

E°<E 



PsTisjE - 

t(k) rii=i hvj 



(6.15) 



where the sum is now over a representative sample of minima. The effect of 
Qs can be incorporated using the quench statistics. If the system is ergodic 
and the MD run is performed at constant energy E' , the number of quenches 
to a minimum, 7, is assumed to be proportional to the density of states of 
the set of ps minima, i.e. ‘j{E')s oc psSt{E')s- Hence, 

E2<E ^ 

/ JT' ZT’O \ 

“ ITT# ^ <«■“) 



This reweighting technique is analogous to the histogram Monte-Carlo ap- 
proach [135-138], but instead of determining the configurational density 
of states from the canonical potential energy distribution, p, effectively a 
density of minima, is found from the microcanonical probability distribu- 
tion for quenching to a minimum. If all the low energy minima are known 
equation (6.8) should be reasonably accurate at low energies. The propor- 
tionality constant in equation (6.16) can be found by matching it to the low 
energy form of equation (6.8) [132]. 

At low enough energy the term due to the global minimum dominates 
and the proportionality constant in (6.16), c, may be found by comparison 
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with (6.8): 



no{E' - 

l{E%nn)Y{Uh4 



(6.17) 



The harmonic superposition method has three main possible sources of er- 
ror [122, 132]. The first is associated with the statistical accuracy of the 
quench frequencies. These errors can be eliminated in principle by having a 
long enough quench run to ensure ergodicity, and by choosing an appropri- 
ate energy for the MD run so that the relevant regions of phase space are 
all significantly sampled. When studying the thermodynamics of melting 
it is most appropriate to choose E' to lie in the coexistence region, so that 
quenches to solid- like, liquid-like and surface-melted states are frequent. Al- 
ternatively, a simulation that achieves better ergodicity could be used, such 
as simulated tempering [139]. The second possible source of error is the as- 
sumption that the phase space volumes for each minimum can be summed 
independently, i.e. the hyperellipsoids in phase space do not overlap. If 
overlap occurred ^{E) would be overestimated. The third possible source 
of error is the harmonic approximation. Near the bottom of the well the 
harmonic assumption is reasonable, but as the energy is increased some 
parts of the well become increasingly fiat. Consequently the harmonic ap- 
proximation causes Q.{E) to be underestimated. Unfortunately it is also the 
most difficult to correct. 



6.3 Thermodynamics and cluster simulation 

There are some further subtleties involved in the interpretation of either 
molecular dynamics or Monte-Carlo simulations of clusters. If such simula- 
tions are run for long enough, then eventually atoms will evaporate at most 
energies or temperatures of interest. In practice some sort of constraint is 
applied, usually in the form of a container. Alternatively, one can check for 
evaporation periodically and reject or reverse such moves [140,141]. Hence 
the densities of states and partition functions that we consider actually 
correspond to bound clusters. Fortunately, the effect of the container is 
relatively small for a reasonable range of container sizes [48]. However, 
such constraints influence the corresponding density of states and partition 
function, and hence thermodynamic and dynamic properties [142--145]. 

Constraints due to constant total linear and angular momenta, P and L, 
may be treated formally using the accessible total energy density of states: 



n{E,p,L) 



d[H(X,{ps})-E]S 



P - 



N 

^Ps 



dXidX2 . . . dX^Ndpidp2 ■ . . dpsN, 



N 

L ^ ^ X-g A Ps 



(6.18) 
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where Ps is the three-dimensional momentum vector for atom s. We can 
integrate over the momenta analytically [144]. A new derivation will be 
given here, denoting Xsx the x coordinate of atom s, etc., and defining the 
coordinates P* and P'" with components: 



N 



pt 


= '^PsxY^s/^ 
s^l 




pt 


N 

= '^PsyV^/'^ 

S^l 




pt 


N 

= '^PszV^s/Vm 
s^l 




pr 


N 

~ ^ ^X^syPsz ~ ^szPsy 

s^l 


^VtoT/v^ 


pr 


N 

~ ^ ^^{^szPsx ~ ^sxPsz 




pr 

^X 


N 

~ ^ ~ ^syPsa 





s=l 



(6.19) 



where = Ps/ydriJ, M is the total mass, /°, ly and are the principal 
moments of inertia tensor with respect to the centre of mass, R, we define 
Xi = U^(Xi - R), and 






( 6 . 20 ) 



/3t 



N 



with [(X«-R)^(X«-R),5;3^-(A,;3-R^)(A,^-R 



■'IJ} • 



S=1 
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The coordinates in (6.19) define the first six rows of an orthogonal trans- 
formation matrix, V, in the 37V-dimensional momentum space: 



/ 





V 







0 


Xlzy/ml 


X\y^JUl\ 




0 


X\x\f^^^ 


Xly^/^ 




0 


0 


X2zy/^ 


— X2yy/m2 


-X2zy/m^ 


0 


X2x^/'ni2 


X2y^/m^ 


—X2xy/^ 


0 


0 


XNzy/mN 


—X]s[yy/mN 


■X^zy^rUN 


0 


XNxy/lTT-N 


XNyy/mN 


— X Nxy/lTT-N 


0 



\ 



/ 



It is not difficult to show that these first six columns are orthonormal; for 
example, 



3N 



N 



i^l 

3N 

J2V,2V,4 



'^ms{Xgy + Xg^)/I° — 1 
S^l 
N 

-Y,rrisXs,/VM = 0 



3N 



s=l 

N 






S=1 



where the second and third lines follow from 

N 

V^'^rUsiX,- R) = U^(MR-MR) = 0, 



N 



and Ulp 1% U^5 = Sc5l° = 5 



^7 



s=l 



(6.21) 



( 6 . 22 ) 






The Jacobian for the transformation from the {Ps} to the {p^} is 
. The kinetic energy is diagonal in terms of {p^}, and this form 
is preserved by the orthogonal transformation specified by V, for which the 
Jacobian is unity. The delta function for the centre of mass linear momen- 
tum may be rearranged as 



N 






dP^dP^dP^ = / S 



p - p*Cm 



P/\/m - P‘j /M^/^dP^dP^dP; 



t 



dP^dP^dPl 



(6.23) 



D.J. Wales: Energy Landscapes 



489 



because / (5(aa;)da; = / i5(a;)/|a|da;. Hence, integration over P* fixes the 
centre of mass kinetic energy in H to P'^I2M and produces a factor of 
Defining L° = R /\ ^ ^ Ps the dcltci function for the cin^ulcir 
momentum becomes: 



N 



L ^ ^ A Ps 



= 5 






N 



(L-L°)-^(X«-R)Ap« 



(6.24) 



S=1 



= (5 



N 



(L - L°) - DU^(X, - R) A p. 



(6.25) 



S=1 



= ^5[DU^(L-L°)-P'-], (6.26) 

where |D| is the determinant of D and = U^(C)“^U, so that Dap = 
Sap / and DU^(Xs — R) Aps = P’'. Integration over P"" therefore 

introduces a factor of 1 /a/AAA = 1/|C and fixes the contribution to 
the kinetic energy from angular momentum about the centre of mass to 



(P'-)2/2 = (L - L°)^UD'^DU^(L - L°) /2 

= (L-L°)^(I°)-/L-L°)/2. (6.27) 



It remains to integrate over the remaining 3iV — 6 linear combinations of the 
p^a^ which we denote by /'y, Csi • • • i Csn- These linear combinations need not 
be specified so long as the overall transformation is orthogonal, and leaves 
the kinetic energy diagonal. Hence (6.18) reduces to 



n N ?> j'l 



X S 



V{X) + J2 



M3/2 

cf p2 (L-L0)^(I0)-1(L-L°) 



i=7 



2 2M 



-E 



dXidX2 . . . dXaArdCTdCs . . . dCsAr. 



|l0|l/2 

(6.28) 

(6.29) 



Defining 






Ci = Ci/'sA, and using the result [146] f S[f{x)]dx = f 5[x — Xo\/\f {xq)\<1x, 
where /(sq) = Oj we may write 




C-E 



i/2 



/2E, 



(6.31) 
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where C = The standard formula for the surface area of a 

hypersphere radius R in k dimensions is + 1), and hence 

we finally obtain [144] 




(6.32) 



where 0 is the Heaviside step function, restricting the range of the integrand 
to if > 0. 

To describe the motion of a non-rigid molecule containing N atoms 
it is usual to use three centre of mass coordinates, three Euler angles 
and the 3N — 6 vibrational normal coordinates [44,45,57]. The orienta- 
tion of the molecule relative to a stationary laboratory frame may be fixed 
using the Eckart conditions [44,45,57]: 



where is the x component of atom s in the equilibrium reference geom- 
etry, etc. The above conditions provide an unambiguous way to specify the 
orientation of the “molecule fixed” (or “nuclear fixed” [44]) axes from the 
nuclear coordinates. From (4.9) we see that they are equivalent to setting 
the normal coordinates corresponding to infinitesimal rotations to zero. For 
the equilibrium nuclear configuration the Eckart conditions (6.33) are satis- 
fied by the principal axes in which the moment of inertia tensor is diagonal. 
Otherwise (6.33) must be solved numerically to obtain the necessary Euler 
angles, and hence the orientation of the rotating molecule fixed axis system 
in which the vibrational angular momentum is minimised [44,45,57]. 

For simplicity, we now set the centre of mass to the origin, so that Lq = 
0, and consider a non-translating, non-rotating cluster with P = L = 0. 
If we consider a normal mode expansion for the position coordinates about 
a local minimum, and assume that where 1° is the fixed 

equilibrium moment of inertia tensor, then (6.32) may be integrated over 
the vibrational coordinates as in Section 6.1 to obtain 



N 




s=l 

N 






S=1 

N 




(6.33) 



f^(T^,0,0) ^«p(^)^3/2|I0|l/2 




j dQ‘’'“MQ™*, (6.34) 
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where uJ is the geometric mean of the non-zero {wq} from (6.1). To fix the 
centre of mass and orientation of the cluster we can transform the remaining 
six normal coordinates to the centre of mass coordinates, R, and three Euler 
angles. This transformation requires a Jacobian factor, and then integration 
over delta functions in the centre of mass and Euler angle variables simply 
gives unity. Since = VmR the Jacobian |dQ*"“VdR| = M^/^. 

Expressions for infinitesimal rotations about the principal axes may be ob- 
tained in terms of the change in the corresponding Euler angle and in terms 
of one of the dQ™* from (4.9). Hence we obtain a further Jacobian factor of 
The final result for a non-rotating cluster with fixed centre of mass 
in the normal mode approximation is: 

= (6-35) 



in agreement with (6.7). 

For a rotating cluster with P = 0 (6.32) the kinetic and configurational 
densities of states may be factored [144] as 



L!(E,0,L)= [ riK{EKWE - EK^dEK, 
Jo 



(6.36) 



where riK(£'K) is given by (6.5) and 

, , f S\Ec-V(X.) 

f^c(i?c,L) = J ^ Mol2\io\i/2 ^dXidX2 . . .dX3^. (6.37) 



Hence, even for a non-rotating cluster with L = 0 there is a geometrical 
factor in the configurational density of states due to the term 1 /|I°|^/^ 
[142-144]. 



6.4 Example: Isomerisation dynamics of LJj 

Miller and Wales used frequent quenching from microcanonical MD sim- 
ulations to calculate the distribution of lifetimes for the four geometrical 
isomers of the LJ 7 cluster [147]. The contribution of each isomer to the 
density of states was also obtained using Metropolis importance sampling 
of configuration space in the microcanonical ensemble. The occupation 
probabilities obtained from these contributions were found to be in good 
agreement with the MD quench statistics, in contrast to the results of Seko 
and Takatsuka, who concluded that the dynamics are non-ergodic in the 
melting region [148]. 

Although the MD and MC results of Figure 20 using (6.38) are very 
close, there is a perceptible systematic drift at higher energies because the 
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-15 -14 -13 -12 -11 




-15 -14 -13 -12 -11 

total energy /e 

Fig. 20. Occupation fractions for each of the four minima of LJy as a function of 
energy: top and bottom, results using (6.38) [147] and (6.39) [149], respectively. 



moment of inertia factor was not included in the MC acceptance condition. 
Miller [149] has shown that the occupation probabilities from MC and MD 
simulations are in even better agreement when the MC acceptance probabil- 
ity is changed from the usual Metropolis condition [150] for microcanonical 
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MC [151] 

7t(X — !■ X^) = min 



/ [E-E(X')]”/^"^0[E-E(X')]\ 

V’ [E-y(X)]«/2-i0[E-E(X)] j 



to [144] 

[E-E(X')]'^/2-1v1^0[E-E(X')]\ 

7t(X — > X ) = mm 1, ^ == • 

V [E-v{x)]-/^-^y^e[E-v{x)] ) 



(6.38) 



(6.39) 



The systematic drift observed with increasing energy is eliminated when 
the correct configuration space weighting is used (Fig. 20). The correction 
increases with energy because fiuctuations in the moment of inertia increase. 
For larger clusters (with L = 0) this effect would probably be even smaller, 
and so the uncorrected acceptance probability should give good results. 



7 Finite size phase transitions 

A first order phase transition in a bulk system occurs when the appropri- 
ate thermodynamic potential {e.g. free energy or entropy) exhibits a double 
minimum (or maximum) over some range of parameter space, with a bar- 
rier (or well) between the two extrema. For a solid-liquid transition the 
control parameter may be either temperature or pressure. If we consider 
conditions of constant N, P and T then the transition occurs where the 
chemical potentials of the solid and liquid phases are equal. Above and be- 
low the transition temperature one of the free energy minima corresponds 
to a metastable phase, and hysteresis (superheating or supercooling) may 
be observed in heating and cooling curves because of the free energy barrier 
between them [152]. For fixed N and P the two phases will coexist at the 
transition temperature, and in the P — T plane there is a coexistence line 
in the phase diagram. The free energy derivative, and hence the entropy 
and other thermodynamic and structural properties, are discontinuous at 
the transition. The heat capacity is therefore singular at the transition 
temperature, since it is a second derivative property. 

The melting transition of a finite system is different from bulk in several 
ways. For finite systems results obtained from different ensembles need 
only agree to order 0{1/N) [131]. This ensemble dependence resulted in 
controversy and confusion in some earlier work [153], but was recognised in 
the context of astrophysics by Lynden-Bell and Lynden-Bell [154] and by 
Honeycutt and Anderson [155] in the context of clusters. Bixon and Jortner 
compared the caloric curves generated by simple model partition functions 
in the microcanonical and canonical ensembles, and noted that a negative 
slope is impossible in the latter ensemble because it is proportional to the 
mean squared energy fluctuation [156]. 
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The singularities that characterise a bulk transition are rounded in fi- 
nite systems, and the width of the transition is predicted [157,158] to scale 
as 1/iV. Furthermore, the van Hove theorem [159], which forbids the ap- 
pearance of van der Waals loops, or S'-bends, in bulk material, does not 
apply. Accurate calculations of the total energy density of states using the 
histogram Monte Carlo approach [160] finally confirmed that such features 
are not the consequence of mean-field approximations [48,161,162]. In finite 
systems the energetic or entropic penalty for the formation of an interface 
can prevent solidlike and liquidlike forms from coexisting in contact, and 
instead it is common to observe a range of energy (or temperature) for 
which the system samples two separate forms [153]. This situation must be 
distinguished from the loops observed in approximate theories of the solid- 
liquid phase transition, which result from the neglect of the phase space 
corresponding to phase separation [163,164]. 

7. 1 Stability and van der Waals loops 

Stability conditions are defined in terms of derivatives of the appropriate 
thermodynamic potential [165]. For the microcanonical ensemble the en- 
tropy, S, is maximised when dT /dE > 0. Some finite systems exhibit a loop 
in {T{E )) the microcanonical caloric curve. Some authors reserve the term 
“van der Waals loop” for loops that result from approximate theories, but 
here we will use “van der Waals loop” and “5-bend” interchangeably. The 
branches with positive slope are interpreted as stable solid-like and liquid- 
like forms (Fig. 21). The canonical caloric curve, T{{E)^^^) cannot have 
a negative slope because dE/dT = {{E — (E) / kT'^ . Such phe- 
nomenology was certainly well understood by Hill in the context of ligand 
adsorption on proteins well before the first simulations of clusters [166-168]. 

More recently, order parameters have been used to provide a natural 
connection to bulk phase transitions and an operational criterion for phase 
coexistence [141,164,169]. Here we define a Landau free energy and entropy 
as 



Al(Q) = A{T) - fcrinpcan(Q), 

5l(Q) = 5(A) + fclnp^(Q). (7.1) 

Al and 5 l are examples of restricted thermodynamic potentials. 

The canonical probability distribution of the total energy is: 

Pcan(L^; T) = n{N, H, E) eM-E/kT)/Q{N, M, T) 



f dlnpca^n{E]T) 






dE 




1 

T 



1 



(7.2) 



so 



k 



N,V 
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General relations can be obtained for other conjugate variables and ensem- 
bles related by Legendre transforms [170]. 

The above results can be combined to deduce the following equivalent 
conditions related to the finite system analogue of the melting transition 
(Fig. 21): 

• Pcan{E) is bimodal for Tf < T < Tm', 

• there is a loop in (T{E))^, with turning points T/, Tm] 

• there are three energies where {T{E))^ = T for Tf <T < Tm] 

• S{E) [or Inn(E)] must have two inflection points since dS/dE must 
have the same slope at three points; 

• the Landau free energy with Q = E has a double minimum as a 
function of E. 

Physical considerations suggest that the energy density of states obeys 
power laws at high and low energies, so that &^\nQ.{E) / dE"^ is positive 
in both these limits [130]. Two inflection points are therefore guaranteed if 
S{E) exhibits negative curvature. 

Points El and E 2 lying on a common tangent to S{E) have (T{Ei))^ = 
{T{E 2 ))^ = T*. Hence: 

(mW ~ ° ~ ~ 

This is the microcanonical analogue of the Maxwell tie-line construction. 

In bulk systems the “convex intruder” [171] in S{E) is forbidden by the 
van Hove theorem [159] and two phases coexist in contact. For a finite 
system the probability that the system is in a particular form decreases 
smoothly from 1 to 0 for Tf < T < Tm (canonical) (Fig. 21) [172]. For a 
bulk transition the equilibrium constant is a step function, but metastable 
super heated or supercooled phases may still exist. 

Mean field models, such as the van der Waals equation, wrongly predict 
the appearance of loops in the bulk limit because they neglect the phase 
space corresponding to solid and liquid in contact. We do not yet know at 
what size phase separation will begin to be seen in clusters, although for 
(KC 1)32 non- wetted structures have been found in which one part of the 
cluster has the rock-salt structure and the other part an amorphous liquid- 
like structure [173]. The results of a simple model for the total energy 
density of states, written as [163] 




fl{E) — Hsolid(7?) -|- Hiiquid(Tl) -|- 



(7.4) 
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{T{E)), 



Fig. 21. Equivalent conditions for the finite system melting transition. 
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{T{E))^ 




E/N 



Fig. 22. Size dependence of the microcanonical caloric curve for a model that 
admits the phase space corresponding to solidlike and liquidlike regions in contact 



enable the size dependence of the microcanonical caloric curve to be inves- 
tigated (Fig. 22). For small systems ^lynix{E) in phase separated configura- 
tions is negligible, but without this term there would be a loop in the bulk 
limit. 

A loop in the isopotential ensemble caloric curve is a necessary (but not 
sufficient) condition for a loop in (T{E))^ [141]. We can see this by writing 
the canonical probability distribution for the total energy in terms of the 
probability distributions for the potential and kinetic energy: 



[163]. 




E 
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Fig. 23. “Roughened” and reconstructed minima of M 923 (p = 5) [183]. 



This result follows from the the expression for riK(A'K) derived in equa- 
tion (6.5), from which we deduce that 



Pk(Ak;T) = J nK{EK)e~^^/^^dEK 



e : 



'^/2-lp-SK/fcT 



A, 



kI 2-1 EK/kT 



T(k/2) J T(k/2) 



.Q-EK/kT^^ 



(7.5) 



Two groups [174, 175] have used the presence or absence of a loop in the 
microcanonical caloric curve to classify phase transitions as first or second 
order in the Ehrenfest sense [176]. Interfacial entropies have also been calcu- 
lated from the area under the loop following Binder [175,177,178]. In these 
studies the finite size behaviour stems from supercell models of the bulk 
using periodic boundary conditions. For clusters we expect the loop in the 
microcanonical caloric curve to first appear and then disappear as a function 
of size. This behaviour is also likely to be non-monotonic, and hence in- 
ferences of bulk properties based upon such considerations must be treated 
with caution. An alternative approach has been proposed based upon anal- 
ysis of Lee-Yang zeros [179, 180] of the partition function in the complex 
temperature plane [181,182]. However, classification criteria developed for 
bulk matter cannot produce unambiguous results for finite systems. 

Coexistence of different phase-like cluster forms over a range of temper- 
ature (canonical ensemble) or total energy (microcanonical ensemble) can 
be diagnosed using the appropriate restricted thermodynamic potential, as 
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described above [152,169]. However, short time averages of quantities such 
as the kinetic energy have also been used to interrogate dynamical coex- 
istence [153,184-187]. Unfortunately, the choice of a suitable order pa- 
rameter or dynamical property is somewhat arbitrary, and different choices 
need not agree with one another. In particular, dynamical criteria based 
on short time averaging can often reveal coexistence of different phase-like 
forms without the appearance of inflection points in S{E), or any of the 
equivalent thermodynamic features described above. 

Such ambiguities seem inevitable for finite systems, and lead to some 
subjectivity in the description of phenomena such as “surface melting” in 
clusters [183,188,189]. For example, short-time-averaged order parameters 
for LJ 55 produce multimodal distributions [189], but there are no corre- 
sponding features in {T{E))^, etc. [132]. Larger Morse clusters ex- 

hibit an anti-Mackay surface reconstruction before they melt, which only 
produces an inflection in {T{E))^ [132]. Surface disordering due to edge- 
and vertex- migration occurs first, probably analogous to surface roughening 
(Fig. 23). The temperature at which magic numbers disappear experimen- 
tally may correspond to surface reconstruction rather than melting [190]. 

8 Global optimisation 

The global optimisation problem is a subject of intense current interest. 
Applications of obvious economic importance include travelling salesman- 
type problems and the design of microprocessor circuitry. In the domain of 
atoms and molecules, one often wishes to discover the lowest-energy isomer 
or crystal structure for a system with a given composition. Attempting to 
predict the three-dimensional structure of a protein solely from its amino 
acid sequence by computer simulation is a major research effort. 

A random conformational search for the global minimum of a typical 
protein would take an astronomical time. However, it is now generally 
agreed that the search is not random, but instead the PES is biased towards 
the global minimum, resulting in efficient relaxation (Sect. 1.1). Hence 
the amino acid sequences of naturally occurring proteins have presumably 
evolved to fold rapidly into a unique native structure. In treating any non- 
trivial global optimisation problem, the principal difficulty arises from the 
exponentially large number of minima on the PES (Sect. 3.1). 

For the cluster of 55 atoms interacting by a Lennard- Jones (LJ) poten- 
tial (mentioned in Sect. 4.4.4) the number of minima (excluding permuta- 
tional isomers) is at least 10 ^°, but the global minimum is relatively easy 
to locate because of the focusing properties of the energy landscape. Most 
global minima for LJ clusters containing fewer than 100 atoms are based 
upon icosahedral packing. The exceptions (LJ 38 , LJ75_77 and LJgg) serve as 
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Fig. 24. From left to right; global minima of the LJ potential for 38 atoms (trun- 
cated octahedron), 55 atoms (Mackay icosahedron [191]), and 75 atoms (Marks 
decahedron [192]). 

particularly interesting test cases, because the corresponding energy land- 
scapes consist of two families of structures. Two examples, the truncated 
octahedron for LJag and the Marks decahedron [192] for LJ 75 , are shown 
in Figure 24 [193]. At these sizes, the lowest-energy minimum based upon 
icosahedral packing acts as a trap, and is widely separated from the true 
global minimum. Actually, it is quite easy to find the global minima of these 
clusters by seeding the starting geometry with a core of the right type. 

Simulated annealing provided the first generally applicable technique 
for global optimisation [194]. Here the state of the system is followed by 
simulation as the temperature is decreased slowly from a high value, in 
the hope that it will eventually come to rest at the global potential energy 
minimum. Unfortunately, the free energy global minimum may change at 
a temperature where energy barriers are too high for the system to escape 
from a local minimum. 



8.1 Basin-hopping global optimisation 

There is a simple transformation of the energy landscape that does not 
change the global minimum, or the relative energies of any local minima 
[195]: 

A(X) = min{A(X)} (8.1) 

where “min” signifies that an energy minimisation is carried out starting 
from X. The transformed energy, A(X), at any point, X, becomes the 
energy of the structure obtained by minimisation. Each local minimum 
is, therefore, surrounded by a catchment basin [56, 127] of constant energy 
consisting of all the neighbouring geometries from which that particular 
minimum is obtained. The catchment basin transformation removes all 
the transition state regions from the surface and accelerates the dynamics 
because the system can pass between basins all along their boundary. Atoms 
can even pass through each other without encountering prohibitive energy 
barriers. 
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Fig. 25. Illustration of the -E(X) energy landscape transformation. 



The basin-hopping approach therefore transforms the energy landscape 
to a discrete set of energy levels corresponding to the energies of local min- 
ima (Fig. 25), and must be combined with a search strategy. In the “Monte- 
Carlo plus energy minimisation” (MCM) procedure steps are proposed by 
perturbing the current coordinates and carrying out a minimisation from 
the resulting geometry [23,196,197]. A step is accepted if the energy of the 
new minimum, Enew, is lower than the starting point, Eom- If E„ew > Tfoid 
then the step is accepted if exp[(Eoid — E„ew) /kT] is greater than a random 
number drawn from the interval [0,1]. The temperature, T, becomes an 
adjustable parameter. 

The mean number of basin-hopping steps and cpu time required to find 
the global minimum at a fixed temperature are shown in Figure 26 for LJ„ 
up to n = 74. The results are averages over 100 different random starting 
points in each case. A simple basin-hopping program in fortran 77 can 
be downloaded from http://brian.ch.Ccim.ac.uk/software.html and 
collected results for cluster global minima may be found in the Cambridge 
Cluster Database [198]. 




502 



Atomic Clusters and Nanoparticles 




Fig. 26. Statistics for the MCM procedure with LJn up to n = 74. The cpu 
time (seconds) in the lower curve is for a 250 MHz Sun Ultra II processor, and the 
number of basin-hopping steps in the upper plot is dimensionless. The horizontal 
axis is n, the number of atoms. 
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Abstract 

A novel approach to computer simulation study of a finite many-body 
system is considered that allows one to gain a detailed information 
about the system, including its potential energy surface (PES), equi- 
librium properties and kinetics. The approach is based on a confine- 
ment of the molecular dynamics trajectory of the system to the catch- 
ment basins on the PES and goes beyond the limitations imposed by 
the inherent dynamics and statistics of the system that restrict the 
capabilities of the existing methods. The approach is illustrated by 
the solution of typical problems for several finite many-body systems 
(rare-gas and carbon clusters, the molecule of alanin tetrapeptide) . 

1 Introduction 

In these lectures we shall be discussing direct computer simulations of finite 
many-body systems such as clusters and biomolecules. More specifically, 
we will work in terms of classical dynamics and statistics, being interested 
in such things as potential energy surfaces (PESs) of the systems, their 
equilibrium (thermodynamic) properties and kinetics. We will not touch 
the electronic structure, optical and magnetic properties and related things 
that require quantum-mechanical consideration. 

Provided that the interatomic potential for the system is known, all 
information of that sort, which one may want to know about the sys- 
tem, can, in principle, be obtained with the help of the direct computer 
simulation methods: of the method of molecular dynamics (MD) and/or 
Monte-Carlo (MC) [1,2]. With time, as more and more information about 
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interatomic forces comes from the experiment, this way of study of many- 
body systems becomes more and more promising. However certain problems 
are encountered on this way. 

The first and general problem is that the current computers are not 
able to compete with nature in performance. To see how large is the gap, 
consider the frequency of atomic vibrations, which can serve as a charac- 
teristic rate for dynamic phenomena in multiatomic systems. For example, 
for a system which consists of 10 ^ atoms interacting through an analyti- 
cally given semi-empirical potential, one time step of numerical simulation 
on a current one-node computer typically requires the time of the order of 
10“^ s. Therefore, one act of vibrations, usually taken to be of the order 
of 10 ^ time steps, requires about 1 s, whereas in reality it is of the order of 
10“^^ s. It follows that the simulation of a microsecond scale process will 
take 10 ® s, that is about ten days, and the study of this process (collecting 
reasonable statistics, etc.) will require about one year. At the same time, 
many systems of interest are much larger in size, and characteristic time 
scales may expand even to seconds, e.g. in the case of biomolecules. 

It is evident that such a huge gap between the characteristic rates in 
nature and in the simulations can hardly be narrowed to a desirable degree 
by a simple increase of the performance of current computers or by using 
massively parallel machines, at least in the nearest future. A solution of 
the problem should rather be sought on the way of devising an appropriate 
approach to the simulations. Since the proper dynamics of a system is too 
slow from the computational point of view, such an approach should not 
adhere this dynamics, however it should allow a reconstruction of the true 
behaviour of the system from the data obtained in the simulations. This is 
a goal which the confinement simulations pursue and achieve: going beyond 
the limitations imposed by the inherent dynamics and statistics, they reduce 
the computational time to the orders of magnitude and at the same time 
allow one to obtain a detailed information about the behaviour of a system. 

In addition to the previously mentioned general problem there exist two 
more problems that are specific to many-body systems. 

The first one is that a multiatomic system typically possesses a large 
number of mechanically stable atomic configurations. These configurations 
are known as inherent structures of the system and correspond to local min- 
ima on the PES; in the case of clusters and biomolecules they are usually 
called isomers and conformers, respectively. For example, for the cluster 
of 13 atoms interacting through the Lennard- Jones potential (LJ), LJ 13 , 
about 1500 geometrically different isomers are known (Sect. 7.1), and for 
the molecule of alanin tetrapeptide, which consists of 24 atoms, about 1000 
conformers (Sect. 7.4). Moreover, the number of the structures grows with 
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Fig. 1. Schematic picture of a multifunnel potential energy surface. 



system size exponentially [3], reaching astronomically large values even for 
systems of moderate size. For example, for LJ 55 it is estimated as 10^^ [4]. 

A wide variety of inherent structures evidently hinders the simulation 
study of the system. If, say, we are interested in the properties that char- 
acterize the system as a whole (for example, in thermodynamics functions), 
an unreasonably large time may be required to visit the structures whose 
contribution is essential. On the contrary, if our goal is specific structures 
or channels of transitions, another problem arises (which is the inverse side 
of the previous one): since every particular structure is visited rarely, it is 
difficult to gain satisfactory statistics for the quantities of interest. 

The problem is complicated by the fact that the physical properties 
of systems of small and intermediate size vary with size strongly and not 
smoothly [5] . Therefore scaling these properties to finite systems of larger 
size is difficult. 

The other problem is that the PES may be very complex (Fig. 1). Min- 
ima may be positioned on very different height and/or be separated by high 
barriers. Furthermore, they may group together in clusters of minima (so 
called superbasins or funnels), which are separated by even higher barriers. 
In accordance with the Boltzmann distribution, the system predominantly 
samples the lowest part of the PES. Therefore at reasonable temperatures, 
or under limited length of the MD (MC) run, it may be trapped in a super- 
basin. 

The latter two problems add complexity to the simulations, making the 
achievement of ergodic behaviour of the system even more difficult. 

The simplest approach which allows one to conduct a comprehensive 
study of a many-body system, including its PES, equilibrium properties and 
kinetics, is due to Stillinger and Weber [3]. They suggested to supplement 
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Fig. 2. Schematic picture of a 2D potential energy surface, (a) Conventional 
exploration of the surface, and (b) confinement to the catchment basin. The 
dash-dot lines show the boundaries of the catchment basin of interest, the thick 
lines the MD trajectory (in the lower panel, the dashed line segments correspond 
to the parts of the trajectory outside the basin), and the dashed lines show the 
quench paths. 



the MD (or MC) simulations by a diagnostic quenching of the system at 
regular intervals. Each quench leads to a certain minimum on the PES, 
with a variety of quenches leading to the same minimum. The points at 
which the quenches leading to the same minimum are initiated represent a 
catchment basin on the PES that surrounds this minimum. The boundaries 
of the basin are the watershed lines which pass through the saddle points 
connecting the given minimum with the neighboring ones (Fig. 2). The 
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PES can thus be separated into the catchment basins representing different 
inherent structures of the system that are associated with the minima of the 
basins. Correspondingly, executing the quenching of the system along the 
MD (or MC) trajectory, the current points of the trajectory can be related 
to certain catchment basins on the PES, as shown in Figure 2a. In the course 
of exploration of the PES, the system repeatedly visits different catchment 
basins. Therefore, one can collect (separately) the phase points for each 
of the catchment basins. As a result, not only can the PES be surveyed, 
but the equilibrium properties of particular inherent structures and rates 
of the transitions between them can also be calculated. In what follows 
we will refer to this approach to as conventional or S' IF approach. The 
approach is very explicit by the underlying concept and easy to implement, 
but computationally it is not as efficient as desirable. In fact, it encounters 
all the problems that were previously mentioned. 

For the last two decades, a variety of different methods has been sug- 
gested to overcome these problems [6]. The primary concern was to make 
sampling of the PESs more uniform than in the conventional MD and 
MC simulations, and correspondingly, to achieve the ergodic behaviour of 
the system. These methods can roughly be separated into three groups, 
depending on the underlying concept: a modification of the PES by introduc- 
ing an appropriate biasing potential (umbrella potentials [7-9] , multicanoni- 
cal sampling [10,11], a generalized ensemble approach [12,13], and hyperdy- 
namics [14]), biasing the temperature (weighted histogram analysis [15,16] 
and jump-walking [17] methods, and simulated [18] and parallel [19] tem- 
pering), and a modification of the method of surveying the PES (the eigen- 
mode method [20], and the activation-relaxation technique [21-23]). In the 
case of biasing the PES or temperature (the first two groups) the results 
for the true conditions are recovered from the results of the biased simu- 
lations by applying the formulae of statistical mechanics, specifically the 
Boltzmann law. In contrast to the SW approach, the methods listed are 
not of multipurpose character; each of them pursues a specific goal: ei- 
ther the calculation of equilibrium properties (the weighted histogram anal- 
ysis method [15,16], umbrella potentials [7-9], simulated [18] and paral- 
lel [19] tempering, multicanonical sampling [10, 11], generalized ensemble 
approach [12,13]), or the study of kinetics (hyperdynamics [14]), or survey- 
ing a PES (the jump-walking method [17], the eigenmode method [20], and 
the activation-relaxation technique [21-23]). 

Considerably less attention has been paid to the fact that the system, in 
accordance with the Boltzmann statistics, repeatedly visits the catchment 
basins that have been visited. At the same time, if the goal is to survey 
the PES, such events should evidently be avoided because they bear no new 
information about the PES. For this purpose the taboo search method could 
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be employed, which has been elaborated in detail for discrete surfaces [24,25] 
and extended to continuous surfaces [26]. 

Also note that the problem of surveying a PES is closely related to the 
global optimization problem (see, e.g. [27]). In the latter, the goal is the 
global minimum of the surface rather than the surface itself. This brings 
certain specific features in the methods which are used for surveying PESs if 
(when) they are applied for finding the global minimum [27]. There are also 
methods which are designed specifically for a search for the global minimum 
(the simulating annealing [28], and “MC plus energy minimization” proce- 
dure [29] , and conceptually close to them the basin-hopping method [30] , the 
genetic algorithm [31], and the stochastic tunneling [32]). Some of the con- 
cepts that underlie these methods can be useful in application to surveying 
a PES. 

The problem of surveying a PES deserves a more detailed considera- 
tion. A knowledge of the PES is of particular importance, since the PES 
completely defines the behaviour of the system at specific conditions. The 
understanding of how the PES is built provides a deep insight into dynam- 
ics and statistics of the system [1,33,34]. Moreover, the knowledge of the 
PES offers a possibility for writing a master equation that governs system’s 
kinetics [35-40] and thus opens a way for replacing the time consuming 
simulations by a relatively easy solution of the master equation. 

To characterize the PES, one should know, in the first instance, the 
local minima and the saddles that connect these minima [1]. Of primary 
importance are the minima, provided that an indication is given which 
of them are directly connected. With this, finding saddles between the 
minima becomes a straightforward task, which can be fulfilled using one 
of the numerous methods suggested for this purpose, e.g. the method of 
slowest slides [41], the TRAVEL algorithm [42], the ridge method [43] or 
the method of contangency curves [44] (see also Ref. [45], which contains 
a comprehensive bibliography on these and related methods, as well as a 
comparison of typical methods in efficiency). 

One recently developed approach to surveying a PES invokes successive 
walks from one minimum to a neighboring saddle, then from this saddle to a 
next minimum, and so on [20-23] . These walks can be called activation and 
relaxation phase, respectively [21] . For the activation phase one of the uphill 
climbing methods [45] can be used, e.g. the eigenvector-following method 
by Cerjan and Miller [46] in its various modifications, as in [20,23,47,48], 
or a more economical algorithm which does not require evaluation of the 
full Hessian matrix at each step, as in [21,22]. For the relaxation phase one 
of the standard methods, such as the steepest-descent or conjugate gradi- 
ent method, is employed. If the number of minima is not too large, all, or 
practically all, minima on the PES can be located by a direct walk from one 
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minimum to another [20,47,48]. For systems of larger size, where the num- 
ber of minima becomes too large, just a statistical search for the minima is 
possible. For this purpose a transition between any two nearby minima is 
taken as an elementary Metropolis step [49], that is the move from one min- 
imum to another is accepted with a probability of min[l, exp(— Aif/fce’F)]) 
where /cb is the Boltzmann constant, T is the temperature, and Aif is the 
change in energy associated with this move, which can be either the en- 
ergy difference between the minima or the height of the barrier between the 
minima [21,23]. 

In these lectures we shall consider a new approach to the simulation 
study of a finite many-body system. Similar to the SW approach, this 
approach is of a multipurpose character, allowing one to survey a PES and 
to study the equilibrium properties and kinetics of a system. At the same 
time the approach is free from many shortcomings inherent to the SW and 
other approaches. 

2 Key points and advantages of the confinement simulations: General 

remarks 

Before starting a detailed discussion of the approach, let us consider its key 
points, using for this purpose model PESs depicted in Figures 1, 2. 

The basic element of the approach is the confinement of the MD tra- 
jectory of the system to the catchment basins on the PES [50-52] (in what 
follows, instead of “the MD trajectory of the system” and/or “the represen- 
tative point of the system” we shall say for brevity “the system” ; it should 
not lead to confusion) . To initiate simulations, the system is placed into one 
of the catchment basins, and the MD (MC) run is begun (Fig. 2b). As in the 
SW method, the system is quenched at regular intervals in order to check if 
it is still in a given basin or has left it for another, neighboring basin. If left, 
then depending upon the strategy chosen to survey the PES (see below), 
the system is either allowed to pass into the basin in which it was found or 
returned into the given basin for further exploration of this basin (Fig. 2b). 
To return the system into the basin, a certain procedure of the reversal of 
the trajectory is employed. In contrast to the conventional dynamics, the 
system can now be kept in a current basin for an arbitrary long time, and 
thus all characteristics of the structure corresponding to this basin can be 
calculated as accurately as desirable. It is essential that though the system 
does not leave the basin for a time longer than the interval between two 
subsequent quenches (for brevity, the quenching interval), all neighboring 
basins can be determined and all attempts to pass into these basins can be 
recorded. Therefore not only can the thermodynamics (equilibrium) func- 
tions for a given basin be calculated but also the rates of transitions into 
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the neighboring basins, and thus the kinetics of the system. Having the 
option of either allowing the system to pass into a certain basin or not, 
one can explore just a part of PES that is of interest. Also note that the 
approach does not require any prior information about the PES landscape: 
being placed at an arbitrary chosen point on the PES, the system, basin by 
basin, can explore all the PES, or a desirable part of it. 

More can be learned from the consideration of Figure 1, which depicts 
a two-funnel model PES. Assume for simplicity that all basins are identical 
except for their minimum energies; in particular, they have the same mean 
life time, r, and the same probability, p, for the system to go to the higher 
neighboring basin (the probability to go in the inverse direction is then 
1 — p). Assume also that p <C 1, in which case the conventional dynamics 
becomes inefficient. According to the above definitions, the mean time for 
the system to make one step to the higher energy basin is r/p, and the mean 
time to go to the neighboring lower energy basins is r/(l — p) ~ r. Place 
the system at a bottom of one of the funnels. Then, if the conventional 
dynamics is used, in which the system freely goes up and down among the 
basins, the mean time for the system to reach N-th basin from the bottom 
of the funnel is 

icnv = r/p^. 

In the case of the confinement simulations, we can forbid the system to 
return to a lower energy basin, since it has been sampled. Therefore, the 
mean time required to reach the N-th basin is written as 

tcnf = Nt/p 

that is now it increases with the number of basins N linearly and not expo- 
nentially, as before. This reveals an essential difference between the conven- 
tional and confinement dynamics. In particular, when successively confined 
to higher and higher positioned basins, the system much easily overcomes 
the interfunnel barriers. To see how considerable is the gain, set, for ex- 
ample, N = 3 and p = 0.01, which yields tcnv/^cnf = p^~^ /N = 3333. 
Note that for this process the use of constant temperature MD (canonical 
ensemble) is essential, since in the case of constant energy MD (microcanon- 
ical ensemble) the higher the system climbed in potential energy, the lower 
would be its kinetic energy (temperature) and the slower the rates of tran- 
sitions. For the constant temperature MD, in contrast, the temperature 
will be the same in each of the basins, regardless how high the basin is 
positioned. 

This process of basin-by-basin climbing is similar to the process of mul- 
tistep rising a boat in a channel through the locks. The analogy between 
these processes becomes even closer if the system is successively confined 
not to individual basins but to pairs of neighboring basins, in which the 
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upper basin serves as the lower one for the next step of the lockage; then 
the system can freely pass between two connected basins, exactly as a boat 
in the lock when the water in different parts of the lock levels off. 



3 Methods for generating phase trajectories 

To implement the confinement technique, first of all we need a method 
to generate a phase trajectory of the system, i.e. a sequence of points in 
phase space. In general, both MD and MC methods are applicable for this 
purpose. A difference is that the MD gives this sequence as a function of 
time, while the MC simply generates a chain of random points obeying an 
equilibrium statistical distribution, canonical [49] or microcanonical [19], 
depending on the conditions under consideration [53]. Correspondingly, the 
MD method can be used to simulate kinetics of a system directly, whereas 
the MC requires an additional hypothesis to relate the intervals between 
the points to physical time. Therefore we shall consider the MD method. 

Similar to the MC methods, there are two types of the MD methods that 
are used to simulate constant energy and constant temperature conditions, 
respectively. 



3.1 Conventional molecular dynamics 



The conventional MD involves a numerical integration of the coupled 
Newton equations 



d^ri dU 



(3.1) 



where t is the time, U the potential energy of the system, the coordinates 
of z-th atom, and m is the atomic mass (for simplicity, we assume all atoms 
to be identical). 

If no restraints are imposed, these equations describe dynamics of a 
free system. Correspondingly, along a phase trajectory there are conserved 
the total energy E = Pi^/2m + U, linear P = angular 

L = X)r=i ^ P* momenta of the system, where Pi = rndvi/dt is the 
momentum of z-th atom, and n is the number of atoms in the system. 
Issuing phase trajectories at different initial conditions which satisfy these 
integrals of motion, i.e. starting at different points of phase space that 
are located at the intersection of the hypersurfaces E = Eq, P = Pq and 
L = Lq, one can form the microcanonical ensemble of the systems for specific 
values of Aq, Po and Lq. 

This dynamics is also known as Newtonian or constant energy MD. 
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3.2 Stochastic molecular dynamics 



The simplest version of stochastic MD is based on a numerical integration 
of the coupled Langevin’s equations 



TO 





dr. 



Mt) 



(3.2) 



where a is viscosity, and ^i(t) are random (Langevin’s) forces due to ther- 
mal fluctuations in the system at given temperature T. These forces have 
mean zero and are related to the viscosity by the fluctuation-dissipation 
theorem: 

(4>j(t)) = 0, {(^i{t)(^i, {t + r)) = 2akBTS^i'Sjf6{T) (3.3) 



where /cb is the Boltzmann constant, the upper index at d> stands for a 
component of the vector {j,j' = 1,2,3), the angular brackets denote an 
ensemble average, and Skw and 5{t) are the Kronecker and Dirac deltas, 
respectively. This MD simulates constant temperature conditions, with a 
specific value of the temperature being a result of the balance between 
Langevin’s forces and viscous friction. For a free system none of E, P 
and L is conserved along the phase trajectory. Issuing phase trajectories 
at different points of phase space, one can form canonical ensemble of the 
systems that corresponds to a given temperature T . 

This dynamics is also known as the Langevin or constant temperature 
MD. 

The Langevin MD requires an additional comment. This dynamics is 
often used to simulate in an economic way a subsystem which is a part 
of the system of considerably larger size, with the rest of the system being 
considered as a heat bath to which the subsystem is coupled. Some examples 
of this are given by a cluster in vapor surroundings, a biomolecule in solvent, 
or a cluster on the solid surface. Langevin forces however do not reproduce 
actual forces coming from the surroundings. Equations (3.2, 3.3) rather 
imply that the system is embedded into a structureless fluid which equally 
(though in a random way) affects every atom, regardless either this atom 
is, say, within the system or at its surface. It can be shown (see, e.g. 
[54]) that at t ^ oo the Langevin MD leads to the canonical (Boltzmann) 
distribution at the given temperature, and it can thus be used to calculate 
the equilibrium (thermodynamic) properties of a system. As for kinetics, no 
well justified estimates are available, except for some reasonable qualitative 
considerations [54, 55] . Therefore in most cases kinetics calculated on the 
basis of the Langevin equations can pretend just to a qualitative description 
of the process. 

A more realistic description of the behaviour of a system can be achieved 
on the basis of the generalized (integral-differential) Langevin equation. 
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which does not assume a white noise for random forces (3.3) and accord- 
ingly a constant viscosity a [56] . A problem of application of the generalized 
Langevin MD to a specific system is that one has to know the kernel re- 
sponsible for a retarded effect of the frictional forces. This kernel is however 
specific for a specific system and thus, to be obtained, requires additional 
information about the system (either from experiment or more accurate 
dynamic calculations, see, e.g. [57]). 

Another, recently suggested generalization of the Langevin dynamics 
makes the heat bath partly structural, - it endows particles of the heat 
bath with a finite mass [58]. 

4 Identification of atomic structures 

The next basic component of the confinement technique is the procedure 
of identification of the catchment basin in which the system is currently 
located. 



4 . 1 Quenching procedure 



Each catchment basin can be characterized by its minimum. Therefore we 
should find a way of how to relate a current point in the MD trajectory 
to the corresponding minimum. One possibility is to quench the system. 
In the case of Newtonian MD one can remove the kinetic energy of the 
system and then allow the system to relax, repeating this procedure until 
the kinetic energy comes to zero. In the case of the Langevin MD one can 
simply set and keep temperature zero. In both cases the system should 
come to the minimum of the basin. A more preferable way, however, is 
to exclude dynamic effects from the consideration completely and to work 
solely in terms of the PES. 

The simplest and most commonly used method of this sort is the steepest 
descent method, which involves a numerical integration of the equations 



dr* dU 

dt dri 



(4.1) 



with the atomic configuration corresponding to the current point of the 
MD trajectory being taken for the starting point. These equations describe 
an overdampted motion of atoms in the system at T = 0. They can be 
obtained from (3.2, 3.3), if, assuming the viscosity to be large, one neglects 
the inertia of the atoms (the first term in the left-hand side of (3.2)) and 
set T = 0 in (3.3). For this reason, the steepest-descent procedure is often 
referred to as quenching too. 

A certain disadvantage of the steepest-descent method is its slow con- 
vergence to the minimum: the closer the system to the minimum, the lower 
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is dU /dvi and the slower approaches the system the minimum. This defect 
can be remedied in part by choosing the time step for the integration of 
(4.1) to be proportional to \dU /dvi\~^ . 

A more radical way to avoid slow convergence to the minimum is to 
use the Newton-Raphson algorithm in the final stage of the search, where 
\dU /dvi\ becomes sufficiently small. This algorithm is based on quadratic 
approximation to the surface, and in the range where such approximation 
is valid it enables the attainment of the minimum by a single step 



AR= -H-i 



dU 

9R 



where R = (ri,r2, ..., r„), AR = (Ari, Ar2, ..., Ar„) is the space increment, 
and H is the Hessian (z. e. the matrix of the second order derivatives oiU). 
Therefore when the system is close to the minimum, and hence the PES is 
close to parabolic, just a few Newton-Raphson steps are usually required to 
reach the minimum. 



4.2 Characterization of a minimum 

As soon as the minimum is found, it should be characterized for the purpose 
of subsequent comparison with the other minima involved. In the PES terms 
each of the minima can be characterized by its energy C/min and spectrum 
of the eigenvalues of the Hessian Hi,..., H^n, the latter define the curvature 
of the basin at its bottom. For nonlinear (linear) atomic configuration 
six (five) of the eigenvalues - three for translational and three (two) for 
rotational degrees of freedom - are equal to zero, and the others, 3n — 6 
(3n — 5), corresponding to vibrations, are positive. 

Alternatively, since every minimum is associated with a certain inherent 
structure (isomer, conformer), physical characteristics of these structures 
can be used to distinguish the minima, t/min stands then for the energy of 
the structure, and instead of the Hessian eigenvalues Hi the normal mode 
frequencies tOi are usually used, which are related to the Hi as uii = \/Hi. 

For geometrically different structures, it is usually sufficient to compare 
the structures only for their minimum energies. To make sure that no 
occasional coincidence in energy happened, one can compare the spectra of 
the frequencies as well, even not the spectra themselves but the geometrical 
means of the frequencies tZJ = (nf=i ^ j)^/^, where N = 3n — 6 (3n — 5). All 
other characteristics are rarely used for this purpose, rather they may be of 
interest from physical point of view (the point group and its order h, the 
principal momenta of inertia lai {a = 1, 2, 3), which characterize rotational 
properties of the structure, etc.). 

In certain cases, it may also be necessary to distinguish between per- 
mutational structures, i.e. the structures that are geometrically identical 
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but differ by the order in which the atoms are labeled. This is achieved by 
calculating the distance in configuration space 

after superposition of the structures by the rotation of one of the struc- 
tures around its center-of-mass; here a and [3 label the structures, and 
stand for atomic coordinates counted from the corresponding center- 
of-mass. 

5 Confinement procedures 

A general purpose of the confinement of the system to a catchment basin is 
to keep the behaviour of the system under control: if the system attempts 
to leave the basin, either to allow it to do this or to return the system into 
the basin for further sampling of the basin. In the former case the MD 
trajectory of the system is simply continued in a usual fashion, but in the 
latter case a new trajectory should be initiated in the basin. Our goal then 
is to specify the initial conditions (atomic coordinates and velocities) for 
this new trajectory so that it would reproduce a “proper” behaviour of the 
system in the basin. More rigorously, the phase trajectories obtained at the 
confinement of the system to a basin should be consistent with the trajecto- 
ries, in the part related to the basin, that are obtained in the conventional 
simulations, i.e. when the system freely explores the PES, entering and 
leaving basins. Correspondingly, for a correct reproduction of the equilib- 
rium properties and kinetics we need a statistically representative set of 
such trajectories for each catchment basin involved. 

Let the system leave one of the basins and consider how a new trajec- 
tory can be initiated in it. We shall discuss two methods to do this. The 
sequence of operations in both cases is the same: the system is quenched at 
regular intervals, Tquench, along the MD trajectory, and the current quenched 
structure is compared with that corresponding to the minimum of the given 
basin. If the system is found in a neighboring basin, a new trajectory is 
initiated in the given basin. 

5.1 Reversal of the trajectory at the boundary of the basin. Microcanonical 
ensemble 

Conceptually most simple method is a reversal of the trajectory at the 
boundary of the basin (Fig. 3a). In this case, first of all, the boundary has 
to be accurately located. For this, when the system is found outside the 
basin, the quenching interval Tquench is reduced by half, and the trajectory 
is issued again from the last point where, at quenching, the system was 
found in the given basin. This procedure is repeated until the boundary is 
fixed with an accuracy of one time step of numerical integration of equations 
(3.1). 
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Fig. 3. Initiating a new MD trajectory in the catchment basin (topographical 
view of the 2D potential energy surface). Solid lines show the MD trajectory, 
the dash-dot lines the boundaries of the catchment basins of interest, and the 
dashed lines the quench paths. Figures 0 and 1 indicate the minima of the given 
and neighboring basins, respectively, (a) Reversal of the MD trajectory at the 
boundary of the basin, and (b) initiating the MD trajectory at the point of the 
last quenching in the basin. Solid arrows indicate the directions of new trajectories 
at the points of their initiating, and the dashed arrow in the upper panel shows 
the direction of the reversed trajectory before scattering of atomic velocities. 



The next step is the reversal of the trajectory. Different methods to 
perform it can be suggested. They may vary from a “pure” reversal, when 
the velocities of all atoms change to the opposite ones, to a reflection of 
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the trajectory from the boundary of the basin, which may, in turn, be 
specular, diffusive, or specular-diffusive. Since our concern is to obtain a 
statistically representative set of trajectory, it is essential that the reversed 
trajectory would deviate from the forward one, along which the system 
came to the boundary, as early as possible. The “pure” reversal is the most 
simple procedure, and, in principle, it furnishes the desirable effect due 
to the exponential divergency of phase trajectories [59]. Indeed, since the 
behaviour of the system beside the boundary is essentially nonlinear, one 
can expect that the reversed trajectory will deviate from the forward one in 
a short time. However, one can also expect that the variety of phase points 
generated by such trajectories beside the boundary will be poor as compared 
to that in the internal part of the basin. In this respect, the reflection of 
the trajectory looks much more promising, particularly if it is a diffusive 
reflection. However, to perform it, the orientation of the boundary should 
be known, the determination of which is a complex and time consuming 
procedure in the case of multidimensional surfaces. 

To resolve this problem, there was taken a compromise between the 
above variants of the reversal: the velocities of all atoms at the point of 
the reversal, first, changed their signs, and second, in order to amplify the 
divergence of phase trajectories, they were subjected to a uniform random 
scattering. Specifically, the velocities were scattered within a f3 part of 
their mean value. The results of the simulations were found just slightly 
dependent upon a specific value of [3 (when it varied from 0.2 to 1.0), and 
(3 = 0.4 was used as a regular value. The act of the reversal was followed 
by a correction of the integrals of motion of the system: the total energy E, 
linear P and angular L momenta of the system after the reversal were fitted 
to those before the reversal by shifting and rescaling atomic velocities. 

To test this procedure, there were conducted confinement and conven- 
tional (SW) simulations (z. e. in the latter case the system was also diag- 
nostically quenched at regular intervals as in the confinement simulations). 
For this, there was taken 13-atom cluster with atoms interacting through a 
pairwise additive Lennard- Jones (LJ) potential 

n 

t/ = [{cr/njY^ - {cr/njf]. 

2=1 J <2 

Here rij is the distance between z-th and j-th atoms, and e and a are, 
respectively, the characteristic energy and length, which are specific for a 
specific system (for example, for Ar e = 119.8 K and cr = 3.904 x 10“® cm), 
e and a are usually taken as the corresponding units of the LJ potential. 
Together with the atomic mass, to, they constitute so called LJ systems of 
units, in which, in particular, the time unit is a/ TO tr^/e (2.15x10“^^ s for 
Ar). In what follows, when giving numerical values for LJ clusters, we will 
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use this system of units. The cluster did not execute both overall translation 
and rotation, specifically there was P = L = R = 0, where R = mri/M 
is the center-of-mass position (M = nm is the total mass of the cluster). 

The system of Newtonian equation (3.1) was numerically integrated us- 
ing the algorithm by Schofield [60], the time step r = 0.0025 (5.3x10“^® s 
for Ar). Quenching was executed at intervals of 64 time steps, with each 
quench being continued until the eigenvalues of the Hessian that correspond 
to the translational and rotational degrees of freedom reached the value of 
10“®. To reveal possible dissociation (evaporation) of the cluster, the fol- 
lowing criterion was used: the cluster was considered to be in a dissociated 
state if it divided into fragments (possibly consisting of a single atom) which 
were spaced from each other by the distance of 2.6 or larger. If dissociation 
occurred in the course of the conventional simulations, the MD trajectory 
was terminated, and a new one was started. In the case of the confinement 
simulations dissociation was automatically prevented because the system 
was confined to the basins corresponding to bound-state structures. 

To compare the confinement and conventional simulations, we will con- 
sider two characteristics, which are related to the equilibrium properties 
and kinetics, respectively (Figs. 4-6). 

Let us start with the equilibrium properties. In the confinement simu- 
lations the system was confined to the catchment basin corresponding to a 
specific isomer: the atoms are labeled in a certain order, no inversions are 
expected, but various orientations of the cluster are possible (since the set of 
orientations is continuous, such basin is a sort of closed valley rather than the 
true basin). Then at the conditions under consideration (P = L = R = 0) 
the phase volume associated with this isomer can be written as 

G{E) = [ dr(3”)dp(3")<5(R)(5(P),5(L)0(A - H) (5.1) 



where dr^^”) and dp^^”) are 3n-dimensional elements of the volume in con- 
figuration and momentum space, respectively, S the delta function, 0 the 
step function, H = pj/2m + U the cluster Hamiltonian, and the integral 

with respect to atomic coordinates is taken over the catchment basin under 
consideration. To pass to the corresponding dimensionless quantity, the to- 
tal number of states in quasi-classical approximation, the factor 1/(27 t?i)'^ 
should be assigned to the integral, where fi is the Planck constant, and 
N = in — & \s the number of vibrational degrees of freedom in the cluster 
(the atomic configurations are supposed to be non-linear). 

G{E) completely defines all equilibrium (thermodynamic) properties of 
the system within the given basin, for example, the entropy is S{E) = 
k-Q InG(A). Therefore a good reproduction of the dependence G = G{E) in 
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Fig. 4. Distribution of the microstates over the potential energy of the system in a 
basin corresponding to the ground-state isomer of LJ 13 ; microcanonical ensemble, 
E — 15. Crosses represent the conventional simulations, and the triangles show 
the results of the conhnement simulations: solid triangles are for the reversal of 
the MD trajectories at the boundary of the basin, and the empty triangles for 
initiating the trajectory at the point of the last quenching in the basin. The 
energy is counted from the ground-state isomer energy, Cmin = —44.3268. 



the confinement simulations guarantees a correct calculation of the equilib- 
rium properties. A comparison of this sort will be presented in Section 6.1. 



Here we consider another characteristic, which is closely related to G{E) 
but more illustrative for our current purposes. Integrating over momenta 
in (5.1) (see Ref. [61] for how it can be done), we obtain 

G(E) = c/dr<^->yR) ^ eiE - U) (5.2) 

where C = (a = 1, 2, 3) are the principal momenta 

of inertia of the current atomic configuration, and T is the gamma function. 
Correspondingly, the density of states p{E) = dG/dE is 

«E) = c/*<»”>5(R)J|^-F^^e(E-t/). (5.3) 



Introducing the configuration density of states as 



n{u') = G 



dr(3") 5(R) S{U - U') 

{hhhYGv{N/2) 



Equation (5.3) can be rewritten in the form 



p{E) = J n{U) {E - [/)^/2-i - U). 



(5.4) 
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Then the integrand in this latter equation will present the distribution of 
the microstates over potential energy U at given E. The thermodynamic 
functions (at this value of E) can be calculated as certain moments of this 
distribution, in particular, the density of states p{E) as the moment of zero 
order (with the phase volume then obtained as G{E) = p{E)dE), the 
mean potential and kinetic energy as the corresponding moments of the first 
order, etc. 

Figure 4 compares the distributions of the microstates over the potential 
energy of the system in a basin corresponding to the ground-state isomer 
(C^min = —44.3268). The total energy if = 15 corresponds to the liquid-like 
region (see Fig. 10a). 

To characterize kinetic properties, we shall consider the distribution of 
life times of the system in a basin. Whereas the distribution of the mi- 
crostates (Fig. 4) tells us nothing about how these microstates follow each 
other in time, this characteristic indicates, to a certain degree, either the 
microstates are met in the proper time order (within a single trajectory) or 
not. In the conventional simulations the life time was calculated as the in- 
terval between two subsequent acts of entering and leaving the basin, and at 
confinement of the system, as the interval between two subsequent reversals 
of the trajectory. 

Figure 5a compares the distributions of life times in the same catchment 
basin for LJ13 as in Figure 4. In general, the data are in good agreement. 
A small exception are the smallest times, which correspond to the events 
when the system does not go far away from the boundary of the basin. For 
an insight into this disagreement. Figure 5b separates life times into two 
groups: one group for the events when the system leaves the given basin 
for the basin from which it came, and another for the events when the 
system leaves this basin for another basin. It is seen that though the actual 
dynamics of the system at small times is different for these two groups of 
events, the confinement technique likely offers some similar dynamics. The 
latter dynamics seems to be satisfactory for the second group of the events, 
when at small times the system simply crosses the given basin (such events 
are characteristic of the regions beside the saddle points of high order). 
However, for the first group of the events, when the system returns to the 
basin from which it came, a simple reversal of the trajectory at the boundary 
of the basin is evidently not able to reproduce the dynamics of the system 
at small times. 

This difference in dynamics however weakly affects the rates of the tran- 
sitions between the basins, which are the basic quantities in kinetics (see 
Sect. 7.2). Figure 6 compares the conventional and confinement simulations 
for the rates of the transitions from the ground-state isomer to its per- 
mutational copies and to several low lying excited-state isomers, for which 
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L IFE TIM E , in timesteps 

Fig. 5. Distribution of life times of the system in the basin corresponding to the 
ground-state isomer of LJ 13 ; microcanonical ensemble, E = 15. (a) The complete 
set of the events, (b) All events are separated in two groups: the system leaves 
the given basin for the basin from which it came (the lower set of the data) and 
for another basin (the upper set of the data). The notations are the same as in 
Figure 4. 

reasonably good statistics was achieved in the conventional simulations. In 
both cases the rates were calculated as the mean numbers of the correspond- 
ing events per unit time; the only difference was that in the conventional 
simulations there were counted the actual acts of the transitions, whereas in 
the confinement simulations the attempts of the system to leave the basin. 
These rates indicate, to a certain degree, how the acts of the transitions are 
distributed among different channels. 

Note that the distributions of life times in Figure 5, including those ob- 
tained in the conventional simulations, deviate from the exponential depen- 
dence characteristic of the Poisson decay process (Sect. 7.2, Eq. (7.11)). 
There are two reasons for this. First, these distributions accumulate the 
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Fig. 6. Rates of the transitions from the ground-state isomer of LJ13; micro- 
canonical ensemble, E = 15. The notations are the same as in Figures 4 and 5. 



events for many channels, which are characterized by different mean life 
times (these times are the reciprocal values of the transition rates shown in 
Fig. 6), and second, the process is not pure stochastic because of a compli- 
cated dynamics of the system at small life times [62,63]. 

Conceptually very explicit, this procedure of initiating new trajectories 
in catchment basins suffers from certain disadvantages. First, an accurate 
location of basin’s boundary requires many quenches, which are very time 
consuming. Second, as our experience shows, the reversal of the trajectory 
may happen to be unsuccessful. At low and moderate energies, until saddles 
of high order are involved, the system successfully returned to the basin ev- 
ery time when the reversal was executed. However, as the energy increased, 
not always every act of the reversal furnished the desired result: after a 
few time steps within the true basin the system attempted to leave it again 
(which typically happened when the system was in a region where the num- 
ber of imaginary frequencies was as large as ten or higher). As a result, a 
new act of the reversal had to be executed in a short time. Furthermore, at 
even higher energies, when the system had a pronounced tendency towards 
a decay (evaporation), it happened that the phase trajectory “cycled”: the 
same sequence of the basins (but not of the same phase points!) repeated 
many times. 

5.2 Initiating the trajectory at the point of the last quenching within the basin. 

Microcanonical and canonical ensembles 

From computational point of view, a more promising way is to start a new 
trajectory immediately at the point where the system was found in the true 
basin last time. In this case we avoid the time consuming procedure of 
locating the boundary of the basin, and, presumably, the effect of “cycling” 
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Fig. 7. Distribution of the microstates over the total energy of the system in 
a basin corresponding to the ground-state isomer of LJ 13 ; canonical ensemble, 
T — 0.35. Crosses represent the conventional simulations, and the triangles show 
the results of the confinement simulations with the MD trajectories initiated at 
the points of the last quenching in the basin. The energy is counted from the 
ground-state isomer energy, f/min = —44.3268. 



the trajectory. However, instead, new issues arise: if these truncated trajec- 
tories are able to reproduce a proper behaviour of the system in the basin, 
and how atomic velocities should be chosen to start these trajectories. The 
simplest choice for atomic velocities is to take them as random values obey- 
ing the Maxwell distribution, with the kinetic energy of the system being 
equal to that at the point of the last quenching in the basin. It is under- 
stood that in the case of Newtonian dynamics (microcanonical ensemble) 
the atomic velocities at the starting point are corrected so that they satisfy 
the integrals of motion, exactly as in the preceding section. 

This procedure of initiating the trajectory in the case of microcanoni- 
cal ensemble is illustrated by the corresponding data in Figures 4-6. The 
quenching interval Tquench = 50 time steps, and its variation did not affect 
the results noticeably. For the life time (Fig. 5) there is taken the interval 
between the first and last quenching at which the system was found in the 
given basin. This time evidently differs from the true life time, presumably 
by the order of Tquench) and this difference may be seen in Figure 5 at the 
times comparable with Tquench- 

Figures 7-9 compare the results of the conventional and confinement sim- 
ulations in the case of Langevin’s dynamics {canonical ensemble). 
Equations (3.2, 3.3) were integrated using the numerical algorithm by 
Biswas and Hamann [64]; the time step is equal to 0.01, and the friction 
coefficient to 3.0 (for Ar, 2.15 x 10“^^ s and 1.4 x 10^^ s“^, respectively). 
For testing, as before, a basin for the ground-state isomer of LJ 13 is taken. 
The temperature, which is equal to 0.35 in the LJ units, also corresponds to 
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Fig. 8. Distribution of life times in the basin corresponding to the ground-state 
isomer of LJ13; canonical ensemble, T = 0.35. The complete set of the events (as 
in Fig. 5a). The notations are the same as in Figure 7. 
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Fig. 9. Rates of the transitions from the ground-state isomer of LJ 13 ; canonical 
ensemble, T = 0.35. The notations are the same as in Figures 7 and 8 . 



the liquid-like region (note that microcanonical and canonical caloric curves 
for a system of finite size are characteristically different [65] , in particular, 
compare Figs. 10a and 15). To see how the equilibrium properties are re- 
produced, the distribution of the microstates over the total energies of the 
system is considered. According to the relation 

Z{T) = j dE p{E) expi-E/ksT) 

where Z(T) is the partition function, the role of this distribution in canon- 
ical ensemble is similar to the role which the distribution over the poten- 
tial energy played in microcanonical ensemble (compare this equation with 
(5.4)). 




S.F. Chekmarev: Confinement Technique 



533 



6 Confinement to a selected catchment area. Some applications 

For a catchment area we understand here either a catchment basin or a 
set of connected basins. In all applications microcanonical ensembles of the 
systems are considered, and, correspondingly, Newtonian MD is employed 
for the simulations. New trajectories in catchment basins are initiated by 
the reversal of the trajectories at the boundaries of the basins, as explained 
in Section 5.1. 



6.1 Fractional caloric curves and densities of states of the isomers [51, 52] 

The caloric curve is a relation between the total energy of the system and 
its temperature. For microcanonical ensemble, the total energy has a spe- 
cific value, and the temperature is defined through the mean kinetic energy 
(ifkin) according to the relation (ifkin) = (N/2) kuT; here N is the num- 
ber of active degrees of freedom, in particular, if n-atom system does not 
execute overall translation and rotation and its configuration is nonlinear, 
fV = 3n — 6. Instead of the temperature, the mean kinetic energy itself is 
often considered. Then the caloric curve is a dependence of (ifkin) on E. In 
simulations (ifkin) is calculated as 

^ N n 

{Ekin) = (6.1) 

i=l i=l 

where j labels the points taken into account, and N is the total number of 
the points. If these points are related to an specific isomer, a (fractional) 
caloric curve for this isomer is obtained, and if to all isomers accessible at 
given E, then the total caloric curve. 

Figure 10a presents caloric curves for LJ13. The cluster executes no 
overall translation and rotation. Numerical values are given in the LJ units. 
There are shown the caloric curve for the ground-state isomer, those for 
several low-lying excited-state isomers, and also the total caloric curve. The 
total caloric curve was calculated in a usual fashion, allowing the system to 
explore the PES freely. To calculate caloric curve for a specific isomer, the 
system was confined to one of the catchment basins corresponding to this 
isomer. Characteristics of the isomers are given in Table 1 (in this section 
we use the linear frequency of vibrations v, which is related to the angular 
frequency ui as v = u;/27r). Figure 10a clearly indicates that the caloric 
curves for excited-state isomers are very similar in appearance, and they 
are remarkably different from the caloric curve for the ground-state isomer. 
Moreover, if one counts the total energy for each of the isomers from its 
minimum energy, these curves follow each other with a good accuracy, even 
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Fig. 10. LJi 3 , microcanonical ensemble: isomers are numbered according to 
Table 1, and the total energy is counted from the ground-state isomer energy 
f/min = —44.3268. (a) Caloric curves for individual isomers and the total caloric 
curve (solid circles), (b) Absolute fractional densities of states and the total den- 
sity of states (the dashed line), (c) Relative densities of states (lines) in comparison 
with the relative residence times (symbols). 




S.F. Chekmarev: Confinement Technique 



535 



Table 1 . Characteristics of isomers for LJ13. The isomers are numbered according 
to their order in the isomer energy spectrum, 0 labels the ground-state isomer, 
f/min.r IS the minimum energy of r-th isomer, Vr the geometrical mean of the 
normal frequencies, and hr the point group order. 



Isomer Cmin.r Vr hr 



0 


-44.326801 


1.750316 


120 


1 


-41.471979 


1.636509 


2 


2 


-41.444597 


1.635673 


2 


3 


-41.394398 


1.635996 


2 


4 


-40.758513 


1.638231 


4 


8 


-40.670170 


1.632131 


1 


138 


-38.975810 


1.553827 


6 



though the inherent structures of these isomers are quite different (compare, 
in particular, the point group order hr, Tab. 1). 

These caloric curves make it possible to calculate the absolute densities 
of states, both for the system as a whole and for specific isomers. 

The total phase volume for the system is defined by equation (5.1) except 
that the integral is taken over all points in configuration space. Following 
Stillinger and Weber [3], break the integral into the parts corresponding to 
geometrically different inherent structures (isomers). Each such structure is 
represented by a set of 2n ! /hr geometrically identical copies, where 2n! is the 
order of the complete nuclear permutation and inversion group (CNPI) [66] , 
and hr is the order of the point group of r-th structure. In turn, to each of 
the copies there corresponds a certain catchment basin, how it was defined 
when we wrote equation (5.1) in Section 5.1. Therefore the phase volume 
associated with r-th isomer can be written as 

Gr{E) = {2n\/hr)G{E) (6.2) 

with G{E) given by equation (5.2), and the total phase volume can be 
calculated as 

G{E)=Y,Gr{E). 

r 

Correspondingly, the density of states for r-th isomer and the total density 
of states are calculated as 

Pr{E)=dGr/dE and p{E) = Pr{E). 



As one can find from (5.2) 



dlna/dE = fV/(2(F;ki„,.)) 



(6.3) 
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where (Akm.r) is the mean kinetic energy for r-th isomer. This equation 
is simply an analogue of the thermodynamic equation TdS = dE, with 
S = feelnGr and T = 2{Ey^in,r) /NU b being the entropy and temperature, 
respectively. A similar equation is valid for the whole system. 

With the characteristics of the isomers and the caloric curves being 
known, equation (6.3) can be numerically integrated to give Gr{E) or G{E), 
depending on which of the caloric curves, fractional or total, is used. To 
find the arbitrary factor in the integral, the solution can be matched in the 
lowest energy limit to the harmonic solution 



G-hr(E) 



87r"(A-f/n,in,,)^ 

r(A^ + i)Ff 



where the factor accounts for various orientations of the cluster. When 
calculating G{E), the harmonic solution for the ground state isomer should 
be used. To circumvent the difficulties which one meets at attempting a di- 
rect matching, the right-hand part of (6.3) was divided into two parts: the 
harmonic one, which is N/E in accordance with the equipartition principle, 
and the rest N [l/(2(ifkin,r)) — 1/A], which is an anharmonic correction. 
The first part is integrated analytically, that provides an automatic match- 
ing to the harmonic solution. Figure 10b shows the densities of states so 
obtained from the caloric curves of Figure 10a. 

Calculating the density of states from the caloric curves found by con- 
finement of the system to the catchment basins, we dealt just with local 
regions of the PES. This offers a unique opportunity to address the funda- 
mentals of statistical mechanics - the ergodic hypothesis and the hypothesis 
of equal a priori probabilities. The former states that the time and ensem- 
ble averages are equivalent, and the latter that all microstates which are 
consistent with macroparameters of the systems contribute to the ensemble 
with the same weight. Following these statements, the system, in particu- 
lar, should visit the region of the PES corresponding to r-th isomer with 
the probability 

Pr{E)=pr{E)/p{E) (6.4) 

where Pr{E) and p{E) are the density of states for r-th isomer and the 
total density of states, respectively. Therefore, we can count the relative 
residence time for a pair of isomers, r and q, in the conventional simulations 
and compare it with the corresponding relative density of states frq{E) = 
Pr{E)/Pq{E) = pr{E) / pq{E) obtained from the confinement simulations. 
The residence time is calculated as the number of quenches leading to the 
corresponding basin (it should not be confused with the life time, which 
was considered in Sect. 5; for a difference between them see Sect. 7.2). 
Figure 10c presents such a comparison for the excited-state isomers from 
the Figure 10b (for a reference isomer, q, the ground-state isomer is taken). 
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Data for isomer #138 is not presented because this isomer was visited in the 
conventional simulation so rarely that we were unable to gain satisfactory 
statistics for it. It is seen that the system in the conventional simulations 
behaves well in accordance with the previously mentioned statements, at 
least on interbasin scale. 

6.2 Rates of the transitions between catchment basins. Estimation of the rate 
of a complex transition by successive confinement [50, 52] 

For a model system in this case we consider 15-atom carbon cluster with 
atoms interacting through the semi-empirical Brenner potential [67] modi- 
fied in [68] . 

Let us study first the interconversion of two specific structures, Figure 11. 
For these structures we take the linear chain (C/min = —88.9509 eV) and the 
ring (C/min = —92.1344 eV), with the latter being the ground-state isomer for 
this cluster. To emphasize advantages of the confinement simulations (and, 
partly, to accelerate simulations), let the total energy be rather high, E = 
—58.95 eV, which corresponds to the temperature equal to approximately 
8800 K. 

At first, we confined the system to the basin for the linear chain and 
counted all attempts of the system to leave this basin for all other basins 
that surround the latter. As could be expected, the chain predominantly at- 
tempted to dissociate (the relative number of events pd = 0.877), 
Figure 11a. All other attempts were related to its conversion either into 
the ring (p = 0.016) or into other bound structures (a ring with one or two 
tails, a double ring, a bowl, etc., ph = 0.107 in total). The absolute value of 
the rate of the transitions from the linear chain to the ring, corresponding 
to the given value of p, was found equal to 5.89x10^° s“^. 

These values of p can serve a good illustrations to the efficiency of the 
confinement simulations: if the conventional approach were used, more than 
98 percent of the acts of the transitions would take the system out of the 
region of the PES that is of interest. 

Then the system was confined to the catchment area consisting of two 
basins, for the chain and ring, so that it could undergo free interconversion 
between these structures (Fig. 11b). The transition rate for the chain-to- 
ring conversion was obtained here as 5.92x10^° s“^, which is in excellent 
agreement with the previously given estimate obtained on the basis of count- 
ing the number of attempts to leave the chain structure basin (Fig. 11a). It 
is pertinent to note that though the ring is lower in energy than the chain, 
the rate of the backward process, i.e. of the ring-to-chain conversion, is 
considerably higher (1.34x10^^ s~^)- According to the detailed balance, 
this implies that the population of the chain structure basin (the residence 
time of the system in it) is higher than that for the ring structure, which. 
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bound structures 



(b) 




Fig. 11. Schematic picture of investigation of C 15 : (a) confinement of the sys- 
tem to the catchment basin for the chain structure (the arrow-like and bowl-like 
structures are just to indicate the classes of dissociated and bound structures, 
respectively); (b) confinement to the catchment area for the chain and ring struc- 
tures. 



in turn, can be explained by a higher value of the conformation entropy of 
the chain as compared to the ring. 

Each of the above approaches can be used for efficient estimating the 
rate of a complex transition, when the initial and terminal structures are 
separated by a set of intermediate structures. Specifically, the system can 
be successively confined to the catchment areas corresponding to two neigh- 
bouring structures (starting from the first pair), or more economically, to 
the catchment basins corresponding to the initial and all the intermediate 
structures. We have discussed this procedure for a model PES (Fig. 1) and 
found it much more efficient than the conventional simulations. 

Figure 12 illustrates this procedure of successive confinement for the cal- 
culation of the rate of the transition from the linear chain to a triple-ring 
structure (f/min = —85.3930 eV) for a specific channel of the transition. 
Starting from the linear chain, the system was then successively confined 
to the basins for the ring and the double-ring (f7mm = —87.4679 eV) struc- 
tures. In each of the basins the system was kept until it made at least 50 
attempts to pass into the basin to which it was supposed to be confined 
next. The arrows in Figure 12 show the probability for the system to leave 
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Fig. 12. Schematic picture of estimating the rate of the transition in the sequence 
of the Ci5 isomers. For details, see the text. 



the corresponding structure: either following a channel of interest (p), or 
dissociating (pd), or converting into the other bound structures (pb)- The 
rate of the first stage of the process (the chain-to-ring conversion), which 
corresponds to p = 0.016, is 5.92 x 10^° s“^. Then the rate of the transition 
from the linear structure to the triple-ring can be estimated by multiplying 
together the above rate and the corresponding values of p for two subse- 
quent forward transitions. It is obtained as 2.1 x 10® s“^. This process 
is obviously too slow to study it by conventional simulations. With the 
time step of numerical integration of Newtonian equations being equal to 

5.3 X 10“^® s, even a simple observation of an act of the transition from the 
linear chain to the triple ring would require the order of 10^® time steps. 
Correspondingly, to calculate the rate of the transition with satisfactory 
statistics, this value had to be increased by one or two orders of magnitude. 

6.3 Creating a subsystem of a complex system. Self-diffusion in the subsystem 
of permutational isomers [52, 63] 

It is common that “real” many-body systems are too complex for theoret- 
ical study, and the clusters is the case. The data obtained in numerical 
simulations (or experiments) represent some averages over a variety of the 
isomers. Because of this, comparison of analytical predictions with simula- 
tion or experimental data turns out to be a real challenge: on the one hand, 
theoretical consideration of a variety of geometrically different isomers is 
impracticable, and on the other, introducing some averages, to account for 
different characteristics of the isomers, may disguise possible defects of the 
theory. 

To this end, it may be useful to consider a subsystem, which, on the one 
hand, retains many of characteristic properties of the system, and on the 
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Fig. 13. (a) The ground-state isomer in LJ 13 , icosahedron, t/min = 
(b) The transition state configuration, dodecahedron, U — —41.5552. 



-44.3268. 




other hand, it is much easier for theoretical considerations. Specific isomers 
that we have considered are, in fact, simple examples of such subsystems. 
Another, a more sophisticated subsystem is the manifold of permutational 
isomers, which allows us to study some aspects of self-diffusion. Specifically 
we shall consider the ground-state isomer of the LJ 13 (Fig. 13a), whose 
permutational copies directly transform into each other through the saddles 
of the first order (transition states), Figure 13b. This transition is the 
rotation of six surface atoms with respect to the other six surface atoms. 
The PES of this subsystem consists of identical catchment basins directly 
connected through identical saddle barriers, and thus the subsystem ideally 
suits for theoretical considerations. 

In the simulations, the MD trajectory of the system was confined to the 
area of the PES corresponding to the permutational copies of the ground- 
state isomer, i.e. just the transitions among the catchment basins for these 
copies were allowed. The self-diffusion coefficient was estimated in the con- 
ventional way, as 

1 rl ” 

i=l 

within the time interval where the mean square displacement ((rj(t) — 
ri( 0 ))^) grew linearly with time (the angular brackets denote an ensem- 
ble average over a set of initial points (ri(0), ..., r„(0)), in which the MD 
trajectories started). 

Figure 14 compares the self-diffusion coefficient for this subsystem and 
that for the whole system, i.e. for the system freely exploring the PES (the 
conventional MD). It is seen that their behaviour in both cases is similar, 
so that such a simple subsystem can be used as a model system to study 
diffusion in the “real” system, which is incomparably more complex. A 
detailed the simulation study of this subsystem has been conducted in [63], 
with the results employed to verify a theoretical model for self-diffusion. 
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Fig. 14. Self-diffusion coefficients in LJ 13 : the triangles are for the subsystem of 
the permutational ground-state isomers, and the crosses are for the system freely 
exploring the PES. The total energy is counted from the ground-state isomer 
energy, Umin = -44.3268. 



The coincidence of the points at the lowest energy in Figure 14 reflects 
the fact that the transition state involved is the lowest saddle for LJ 13 . 
Therefore, at small energies the self-diffusion in the cluster starts as a per- 
mutational self-diffusion. 



7 Complex study of a system by successive confinement 

Originally meant for a detailed study of local regions of a PES, some exam- 
ples of which were given in the preceding section, the confinement technique 
was also found to be efficient for a complex study of a system, including its 
PES, equilibrium properties and kinetics [69]. 

For this purpose the MD trajectory of a system is successively confined 
to various catchment basins on the PES. To initiate the simulations, the 
system is placed into one of the basins, and the MD run is began. As before, 
the system is quenched at regular intervals in order to check if it is still in a 
given (current) basin or has left it for another basin. If left, then the system 
is either allowed to pass into a certain new basin or returned to the given 
basin. In the first case the trajectory is continued in the usual fashion, and 
in the second case it starts at the point of the last quenching within the given 
basin (Sect. 5.2). Since both the transition probabilities and thermodynamic 
functions can be calculated as averages of the corresponding quantities over 
the points associated with the given basin, this way of confinement of the 
system is completely consistent with determining its kinetics (Sect. 7.2) and 
equilibrium properties (Sect. 7.3). 
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7.1 Surveying a potential energy surface. Strategies 

7.1.1 Strategies to survey a surface 

The strategy of how to survey a PES is one of the most essential elements 
of the successive confinement approach. It prescribes to which basin the 
system should be confined next and for how long time. These basins are 
chosen from the basins that have been encountered (as the basins into which 
the system attempted to pass when it was confined to preceding basins) but 
have not been sampled yet. The choice of the basins depends on the goal 
of the study and may involve different criteria. Let us list some possible 
strategies. 

1 . If the system is small enough so that a complete sampling of the total 
PES is possible, we can simply allow the system to pass into each new basin 
that has not been previously sampled and study it in turn. 

2. If the system is large, the goal may be 

i) to survey just a part of the PES that is of interest, as for example, 
in Section 6.3, when the self-diffusion in the subsystem of permutational 
isomers was studied, or 

ii) to obtain a “coarse” survey of the total PES; in this case it seems 
reasonable to give preference to the basins for which their inherent structures 
differ most from those for the initial basin and basins which have been 
already sampled. 

3. If one is interested in the dynamics of the system, the preference may 
be given to basins with lower transition states. 

It is essential that in all above cases a prior information about the PES 
landscape is not required: being placed at any point on the PES, the system, 
basin by basin, can successively explore all the PES, or every connected part 
of it that is of interest. 

7.1.2 A taboo search strategy. Fermi- like distribution over the minima 

Let us consider in more detail the simplest strategy, which allows the sys- 
tem to pass into each new basin encountered that has not been previously 
sampled [70]. By the underlying concept, it is very close to so called taboo 
search [24,25], which has been mentioned in the Introduction. To realize 
this strategy, different protocols are possible. We will consider the following 
two. 

Protocol I. This corresponds to the direct taboo search in the terminol- 
ogy of [24, 25]. The system is kept in each of the basins until among the 
basins into which it attempts to pass a new basin appears that has not been 
sampled. To embody this protocol, a list of the basins is made and updated 
that indicates the basins that have been already sampled (in fact, this list 
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contains the characteristics of the corresponding quenched structures that 
are necessary for their identification and comparison, Sect. 4.2). 

Protocolll. This corresponds to a partial stochastic sampling [24,25] and 
assumes that the system is kept in each of the basins until it makes a certain 
number of attempts, fcatt> to pass into neighbouring basins. In this case two 
lists of the basins are made and updated: one indicates the basins that have 
been sampled, as in protocol I, and the other contains information about the 
basins that have been encountered but not sampled yet. Such information 
includes the characteristics of the minima of the basins (to recognize the 
basins) and atomic coordinates and velocities of the system at the points at 
which the system was found in the basins; these coordinates and velocities 
are employed to start the MD trajectories when (if) the system is allowed 
to pass into these basins. The basins to pass into are then taken from this 
second list of the basins. The simplest choice is to take them in the order 
in which they were encountered. However, the preference may be given 
to basins with any desirable property. One example of such choice, the 
preference to the basins with the lowest minimum energy, will be discussed 
later in Section 7.4. 

If no new minima are found while the system is confined to each of the 
basins encountered, the system is returned into the first basin of the second 
list of the basins, and the exploration of the PES is continued until the 
system visits every basin /cj-et times (a new trajectory is thus initiated in 
each of the basins fcattfcret times in total). If an isomer whose permutational 
copy has been already sampled is encountered, in both cases (protocols I and 
II) the transition into the basin corresponding to this isomer is forbidden 
(if, of course, the permutational copies are not of specific interest, as it was 
in Sect. 6.3). 

Optimal values of Tquench, katt and k^et may be dependent upon many 
factors: the nature and size of the system, the type of the dynamics 
(Newtonian or Langevin’s), the conditions at which the simulation is con- 
ducted (the value of total energy or temperature), and the goal of the study. 
Therefore, specification of these parameters, to a large degree, is the matter 
of trial and comparison. In other words, one should try different values of 
the parameters and compare obtained results for the convergence. Never- 
theless, certain recommendations can be given. For example, for an efficient 
surveying a PES, Tquench should evidently be of the order of the life time of 
the system in a basin. This is for both protocols I and II. As for kau and 
fcret in protocol II, the results of the search are dependent upon the product 
of these quantities rather than upon their individual values. Moreover, in 
a wide range of fcatt^ret, provided that none of the factors is unreasonably 
small, this dependence is rather weak. In the simulations presented below 
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Fig. 15. Caloric curve for LJ 13 , canonical ensemble. 



the following values of these parameters were typically used Tquench = 50 
time steps, fcatt = 20 and /cj-et = 20. 

To illustrate protocols I and II, consider surveying the total PES for 
LJ13. The Langevin MD will be used, as described in Section 5.2. In 
protocol II the basins to pass into are taken from the list of the basins to 
be sampled in the order in which these basins are encountered. Specifically, 
the simulations were conducted at three characteristic temperatures which 
are indicated by bars at the (canonical) caloric curve for LJ13 shown in 
Figure 15. They correspond to the solid-like state region (more definitely, 
to approximately the upper bound of this region, T = 0.25), to the melting 
region {T = 0.3) and to the liquid-like state region {T = 0.35). All values 
in this section, here and below, are given in the LJ units. 

Note that in this case the cluster executes the overall translations and 
rotation. Therefore, for the quantitative comparison of this curve with 
the microcanonical caloric curve of Figure 10a the mean total energy in 
Figure 15 should be reduced by the value of 3 /cbT corresponding to the 
translational and rotational degrees of freedom. 

The total number of geometrically different minima found for L J 13 grew 
every work: Hoare and Mclnnes [71], employing a growth algorithm to 
seek for candidate structures, found 988 minima. Later Tsai and Jordan 
[20], who used a direct search by walking on the PES with the help of 
the eigenmode method, increased this number up to 1328. Finally, just 
recently Doye et al. [47] and Ball and Berry [48], who also employed a 
direct search but a different modification of the eigenmode method, found 
1467 and 1478 minima, respectively. The search for the total number of 
minima is complicated by the fact that there is no criterion to decide when 
the exploration of the PES can be regarded to be complete. Also note 
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Fig. 16. Number of minima located with time in the confinement simulations: 
protocol I (solid line) and protocol II (dashed line). LJ 13 , canonical ensemble, 
T = 0.35. 



that the search for specific minimum is a sort of the NP-hard problem (in 
application to clusters, see, e.g. [72]). 

Figure 16 presents a typical dependence of the number of minima located 
with time under protocols I and II. The temperature is for the liquid-like 
state (T = 0.35). In both cases the system was initially placed into the 
basin corresponding to the ground-state isomer (Fig. 13a). Though k^et 
(protocol II) did not reach its maximum value (fc^et = 20) for each of the 
basins, both runs were terminated at 5 x 10^ time steps, where in the case of 
protocol II no new minima were found for approximately 1x10^ last steps. 
The total number of minima found under protocols I and II is 1504 and 
1506, respectively. In both cases it slightly exceeds the maximum number 
of minima reported in previous works. 

As can be seen from the figure, in the initial stage of the search, when the 
list of sampled minima is almost empty, the number of minima in the search 
under protocol I grows a bit faster than that under protocol II. However, 
in the final stage, when that list is almost complete, the situation changes: 
in the case of protocol I the system has to dwell in a current basin for a 
long time until a basin which has not been sampled yet is encountered. 
Moreover, it may happen that the minima which were missed in the initial 
and intermediate stages of the search are not accessible from the basins in 
which the system dwells in the final stage. Therefore, protocol II seems to 
be more preferable for surveying a PES than protocol I. 

Figure 17 shows typical results of surveying the PES for the temper- 
atures indicated in Figure 15. Three upper (thick) curves correspond to 
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Fig. 17. Number of minima located with time in the confinement simulations 
under protocol II (thick curves) and in the conventional simulations (thin curves): 
the long-dash lines correspond to T = 0.25, the short-dash lines to T = 0.3, and 
solid lines to T = 0.35. LJ 13 , canonical ensemble. 



the successive confinement taboo search (SCTS) under protocol II, and the 
other (thin) curves to the direct search by the SW method (SWDS); in both 
cases the quenching interval was the same, 50 time steps. First, as it could 
be expected, the SCTS is faster than the SWDS. Second, and somewhat 
surprising, the SCTS results are just slightly dependent upon the temper- 
ature. By the time when the last of 1506 minima was located at T = 0.35 
(t = 4 X 10^ time steps), there were found 1502 and 1499 minima at T = 0.3 
and T = 0.25, respectively, with total number of minima in both latter cases 
being 1502. This is particularly surprising in comparison with the SWDS 
simulations, where the decrease of the temperature from the liquid-like state 
{T = 0.35) to the solid-like one (T = 0.25) results in a radical reduction of 
the number of minima. 

The energy spectrum of the minima found in these simulations is shown 
in Figure 18. 

It should be noted that the conventional (SWDS) simulations presented 
in Figure 17 are not “pure” conventional: if an act of dissociation (evapo- 
ration) of the cluster was indicated, the MD trajectory was continued as in 
the confinement simulations, that is a new trajectory was initiated at the 
point where the system was found in bound state. Otherwise, at higher 
temperatures the trajectories have to be terminated in a short time: at 
approximately 6 x 10"^ and 3 x 10^ time steps for T = 0.3 and T = 0.35, 
respectively. 




S.F. Chekmarev: Confinement Technique 



547 




0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 9.0 10.0 
ENERGIES OF MINIMA 



Fig. 18. Energy spectrum of minima for LJ 13 . The energy is counted from the 
ground-state isomer energy, C/min = —44.3628. 



The difference between the SWDS and SCTS simulations is well illus- 
trated by the distributions of the number of quenches over the minima 
to which these quenches lead. The SWDS distribution, by its nature, is a 
steady distribution; it depends on the elapsed time only in that the statistics 
becomes richer with time. In contrast, the SCTS distribution is unsteady, 
its front propagates with time. 

The SWDS distribution originates from the Boltzmann distribution. For 
the PES divided into catchment basins, as in Section 6.1, the total partition 
function of the cluster of n atoms is written [3] as 

Z = ^ exp(-C/n,i„,,/fcBT)Z,(T) (7.1) 

i 

where 

dr<=»)dp<- (7.2) 

is the partition function for f-th basin. Here hi is the order of the point 
group, and Hi = Pj/2m -|- Ui is the Hamiltonian of the cluster asso- 

ciated with i-th basin (the potential energy Ui is counted from the basin 
minimum energy [/niin,i)- In the given representation, Z and Zi have a 
meaning of the total numbers of states and the number of states associated 
with z-th basin, respectively. According to (7.1, 7.2), the mean number of 
the quenches which lead to the minimum of z-th basin can be estimated as 

{N,) = Wot exp{-U^i^^JkBT)Z,lZ (7.3) 

where Wot is the total number of quenches, and the angular brackets denote 
the ensemble average. 
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Fig. 19. Distribution of the mean number of quenches over the minima in the 
conventional simulations. Triangles correspond to the simulation data, and crosses 
to equation (7.3), with Zi estimated in the harmonic approximation (7.4). LJ 13 , 
canonical ensemble, T = 0.35. 



Figure 19 compares the SWDS distribution with equation (7.3), in which 
Zi are estimated according to the harmonic approximation [73] 



UT) 



knT \ {2kBTf/H7TlulM^/^ 

kuj, ) hih^ 



(7.4) 



where Ui and lai (a = 1,2,3) are, respectively, the geometrical mean of 
normal frequencies and the principal momenta of inertia for the inherent 
structure corresponding to the minimum of z-th basin; these characteris- 
tics, and also hi, were determined in the course of simulations. The first 
and second multipliers in the right-hand side of (7.4) correspond to the vi- 
brational and rotational degrees of freedom, respectively; a contribution of 
the translational degrees of freedom is omitted, since it is the same for all 
isomers. A discrepancy between the simulation data and the estimate in 
Figure 19 should be attributed to anharmonic effects, because in the liquid- 
like region, to which the value of T = 0.35 corresponds, the anharmonicity 
is appreciable. 

The SCTS distribution is essentially different from the SWDS distribu- 
tion: while the intrabasin statistics remains to be the Boltzmann statistics, 
the interbasin statistics, in the part of visiting the basins that have been 
visited, is inherently the Fermi statistics. To find mean occupations num- 
bers for the basins in this case, we can consider the collection of the isomers 
as an ideal gas, with z-th isomer contributing to the partition function as 
[Zi(T) exp(— [/min,i/feT)]^% where Ni takes the values 0 and 1. Then, 
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following the standard procedure (see, e.g. [74]), we obtain [70] 

where /i, playing a role of chemical potential, is determined by the condition 
J2i i^FL,i) = {Ntot)- In the case of small values of the mean occupation 
numbers (the second term in the denominator is much larger relative to 
unity), which corresponds to the Boltzmann statistics, this equation trans- 
forms into (7.3). The appearance of such a distribution in classical simu- 
lations may seem somewhat unusual. However, in fact, there is no a great 
surprise in this, since the quantum character of the Fermi statistics in its 
conventional applications is due to the Pauli exclusion principle and not to 
the statistics itself. 

Equation (7.5) describes the equilibrium distribution. Therefore it is 
applicable to the SCTS distributions if the latter are of quasi-equilibrium 
character. Further, the system under consideration, LJ13, possesses a finite 
number of the states (minima), therefore equation (7.5) is correct until the 
front of the distribution reaches the upper bound of the energy spectrum. 
Fortunately, the quasi-equilibrium character of the SCTS distributions holds 
well practically for all elapsed times, until the front of the distribution comes 
into the upper bound of the energy spectrum (see Fig. 21 given below). 
Also note that though equation (7.5) is appropriate to protocol I rather 
than to protocol II, at the elapsed times, when the time required to execute 
fcatt attempts to leave a current basin is negligible in comparison with the 
elapsed time, this equation can be applied to protocol II too. 

Figure 20 compares the SCTS distribution for t = 8 x time steps 
with equation (7.5), where Zi are calculated in the harmonic approximation 
(7.4), as in Figure 19. The distribution does not include the repeat quenches 
leading to the sampled minima. Correspondingly, /i in (7.5) was determined 
from the condition that (Atot) is equal to the total number of quenches 
included into the distribution. 

The front of the SCTS distribution proceeds with time as Figure 21 
shows. For an illustrative purpose, to avoid an overlapp of the data for 
different elapsed times, the distributions are given in a smoothed form: 
each of the points represent an average number of quenches falling into one 
of 25 boxes into which the energy range was uniformly divided. All curves, 
except for curve 6, even curve 1, are of quasi-equilibrium pattern, that is 
the corresponding distributions are described by (7.5), approximately with 
the same accuracy as in Figure 20. Curve 6 clearly demonstrates the effect 
of the finite number of the states (basins). 

For each current elapsed time t the sum of (A^) over minima, (A(t)) = 
^j(Ai(t)), presents the number of minima which probably will be found to 
this time in an individual run, as in Figure 17. 
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ENERGIES OF MINIMA 

Fig. 20. Distribution of the mean number of quenches over the minima in the 
confinement simulations (protocol I) . Triangles correspond to the simulation data, 
and crosses to equation (7.5) with Zi calculated in the harmonic approximation 
(7.4). LJi 3, canonical ensemble, T = 0.35. 
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Fig. 21. Evolution of the SCTS distributions (protocol I) with time. Triangles 
indicate the simulation data, and the dashed lines are to guard the eye. Labels 
1,2, 3,4, 5 and 6 correspond to the elapsed times 1.25 x 10®, 5 x 10®, 2 x 10^, 
8 X 10^, 3.2 X 10® and 1.28 x 10® time steps, respectively. LJ13, canonical ensemble, 
T = 0.35. 



Unfortunately, such a relatively simple picture of surveying the PES, 
leading to a propagating Fermi-like distribution, is characteristic only of 
systems whose PESs consist of a single funnel, as for LJ13 and some other 
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clusters {e.g. LJig and LJ 55 [47]). For multifunnel PESs, similar to those 
shown in Figure 1 and will be shown later in Figures 23 and 25, this picture 
is destroyed because of the overlap of the energy spectra in different funnels. 
However, the employed strategy, by itself, turns out to be quite efficient for 
surveying such PESs (Sect. 7.4). 



7.2 Kinetics 



To study kinetics, a master equation is usually used. It is a loss-gain equa- 
tion and describes the time evolution of the probability Pi(t) for finding the 
system in a state i 

^ - W,.PM (7.6) 

i=i 



where N is the total number of states, and Wij is the probability of the 
transition from state j to i per unit time. In the matrix notations, (7.6) 
acquires the form 



dP 

dt 



= WP 



(7.7) 



where P(t) = {Pi{t)} is the vector of probabilities, and W = {Wij — 
^ji} (bJ = 1) ■■■jN) is the transition probability matrix. The equa- 
tion is linear and completely determines P(t) t > t' by P(P) (i.e. it 
implies that the process to be considered is a Markov process). 

Being interested in basin-to-basin kinetics, we shall consider a catchment 
basin for a state. Then Pi (t) will present the probability to find the system in 
basin i, and Wij can be identified with the rate of the transition from basin j 
to i. Wij depend on the conditions under consideration, in particular, on the 
total energy of the system and on the temperature for the microcanonical 
and canonical ensemble, respectively. 

One difficulty in application of the master equation to a specific prob- 
lem is the specification of the transition rates. For this purpose the transi- 
tion state theory is usually employed in its simplest (harmonic approxima- 
tion) version. Specifically, according to the Rice-Ramsperger-Kassel-Marcus 
(RRKM) theory [75], the rate constants for a particular channel of transi- 
tion {1) from basin i to j are defined by the expressions: for microcanonical 
ensemble 




Wm ( E - AU* 
V E 



(7.8) 



and for canonical ensemble 






n 

n 



N 

m—1 

N-1 

m=l 






exp{-AU^ /ksT). 



(7.9) 
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Here N is the number of vibrational degrees of freedom (= 3n — 6 for non- 
linear atomic configurations), and are, respectively, the frequency 
of TO-th normal mode at the minimum of the basin i and at Cth transition 
state between basins i and j, and AC/;* is the height of the barrier corre- 
sponding to the transition state I, which is counted from the bottom of basin 
i (generally speaking, the right-hand sides of (7.8) and (7.9) should also be 
multiplied by the factor hi / h* to account for a possible difference between 
the minimum and the transition state in symmetry). The transition rate 
Wji, entering equation (7.7), is then calculated as the sum of over all 

channels of the transitions from basin i to j, Wji = J2i ^fi- 

In certain cases (7.8) and (7.9) give reasonably good estimates for kji 
(see, e.g. [39] and the discussion therein). Nevertheless, one can hardly 
be a priori sure that these formulae will work well in every specific case, 
particular if the basins are shallow or, that is the same, if the temperature 
is high. In this cases anharmonic effects should be taken in account, which 
complicates the problem to a large degree. 

Confinement simulations allow a direct calculation of the transition 
rates. While the system is confined to a given basin i, one can calculate the 
transition probabilities, Qji, that the system will be found in a certain basin 
j at the subsequent quenching corresponding to the time interval, Tquench- 
Specifically, Qji are calculated as 

Qji — ^ ^ (^’^^) 

i 

where riji is the number of times that the system is found in basin j at the 
subsequent quenching when at the previous quenching it was in basin i. 

It should be noted that j may be equal to i in (7.10). This is presented by 
diagonal components of the matrix, Qu, and corresponds to the possibility 
that the system did not leave basin i, or that it left this basin for a time 
less than Tquench and then returned to it. Therefore, if the probability of 
system’s re-entering is negligible, Qu bears information about the average 
value of the life time of the system in z-th basin, riife,i. Specifically, if the 
life times follow the Poisson distribution 

Qii = exp( Tquench/Tlife,z). ("^-H) 

Successively confining the system to various basins on the PES, we can ob- 
tain the transition probability matrix Q = {Qij} at the conditions of the 
simulation: at constant total energy of the system (microcanonical ensem- 
ble) or at constant temperature (canonical ensemble). This matrix is related 
to the transition rate matrix W as 



Q — ^Xp(WTquench) ■ 



(7.12) 
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Therefore, with the transition probabilities obtained by successive confine- 
ment, we can calculate transition rates. For this, for example, taking the 
quenching interval Tquench to be sufficiently small, the right-hand side of 
equation (7.12) can be expanded in a power series, and keeping the linear 
term yields 

W = (Q-I)/rque„ch. (7.13) 

However, kinetics can be described directly in the terms of the transition 
probabilities, if we treat the transitions between the basins as a Markov 
process in the discrete time domain of quenching intervals. In the framework 
of this formalism, the transition probability matrix Q after n successive 

'^quench StepS is 

Q(?rTquench) — Q (’^quench) ("7-14) 

and the time evolution of the vector of probabilities obeys the equation 

P(t) = Q(nrquench)P(0) (7.15) 

where t = nrquench) and P(0) is the initial vector of probabilities. 

7.3 Equilibrium properties 

In a stationary state, the vector of probabilities P (= P®*) satisfies the 
equation 

pst ^ Qpst 

and, if the transition probability matrix Q is irreducible {i.e. there exists a 
path from one basin to any other) , this vector corresponds to the equilibrium 
distribution of residence probabilities, i.e. P®'^ = P®*. P®'^ can be calculated 
in two ways. One way is to solve (7.16) directly, and the other is to evaluate 
the matrix Q(nTquench) (7.14) at n ^ oo and then to substitute the result 
in (7.15) in order to obtain P®'^. 

From the point of view of statistical mechanics, present relative 
numbers of states associated with certain basins. Specifically, for micro- 
canonical ensemble, in accordance with (6.4) 

PP{E)=p,{E)/p{E) (7.17) 

and for canonical ensemble, according to (7.1, 7.2) 

PP{T) = Z,{T)IZ{T) (7.18) 

with the factor exp(— [/min^i/ZcBT) being included into Zi{T). With P^^{E) 
[P®'^(P)] having been calculated in the course of successive confinement, 
these equations give us Pi{E) [Zi{T)] with an accuracy of a mutual factor 
depending on E [T]. According to the formulae of statistical mechanics. 
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Phc. 1: 

Fig. 22. The tetrapeptide N — acetyl — [Ala)z — methylamide (TP). The dihedral 
angles (j> and ^l> (soft torsions) are on each side of the C°‘ carbons, and u on each 
amide bond. 



Pi{E) determines the absolute value of the mean kinetic energy (Akin) as 
a function of E {i.e. the microcanonical caloric curve for f-th structure), 
and Zi{T) the mean total energy (A) as a function of T (the canonical 
caloric curve for this structure). Other characteristics, such as entropy and 
free energy, can be calculated from A®‘^(A) and A®‘^(T) just as relative 
values with respect to certain reference functions depending on A and T, 
respectively. To obtain the absolute values of these characteristics, it is 
sufficient to calculate the absolute value of Pi{E) [Zi{T)] for one of the 
structures, which makes it possible to find p{E) [Z(T)] from (7.17) [(7.18)]. 
An example of such calculations was given in Section 6.1. 

7.4 Study of the alanine tetrapeptide 

This method of successive confinement was applied to a complex study of the 
molecule of alanine tetrapeptide (TP), N — acetyl — {Ala)s — methylamide, 
in solvent [69] . This molecule is one of the shortest peptides that can form 
a full helical turn and thus is of considerable interest as a model for the 
formation of secondary structures [33,35,38]. This molecule is complex 
enough to have a multifunnel PES, and at the same time it is simple enough 
to allow a direct simulation that made possible to compare the confinement 
and conventional simulations. The covalent structure of the TP is shown in 
Figure 22. 

To conduct simulations, there was used the CHARMM program [76] with 
the ACS implicit representation of the solvent [77] and the polar hydrogen 
parameters (paramlQ) [78]. The Langevin MD was employed, the friction 
coefficient being equal to 64 ps“^ and the time step to 1 fs. For quench- 
ing, the steepest descent method was combined with the adopted-basis set 
Newton-Raphson minimization [76]. To find heights of saddle barriers be- 
tween neighboring minima, the TRAVEL algorithm [42] was employed. 

To study the system, we applied the simplest strategy of successive con- 
finement that was described in Section 7.1.2, specifically protocol II. Despite 
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its simplicity, this strategy was expected to provide an efficient sampling of 
PES of a multifunnel pattern that was anticipated for the TP from previous 
studies [35,38]. Indeed, turn to Figure 1. Let the system be at the bot- 
tom of one of the funnels and allow it to pass just into the basins not been 
previously sampled. Then, since the number of the basins at the bottom is 
finite, new basins in this region inevitably come to end, and the system will 
become explore basins on a higher level of the funnel. Thus, basin by basin, 
the system will climb the surface until it reaches an interfunnel barrier, then 
it will overcome this barrier and pass into the corresponding neighboring 
funnel. At reasonable temperatures (as assumed), the flow of representative 
points of the system at the upper part of any funnel is directed towards the 
bottom. Therefore, when the system is confined to a basin at the upper 
part of the funnel into which it has passed, the basins that lie lower than 
the current basin will appear with a higher probability than the others. As 
a result, the system, basin by basin, will descent towards the bottom of this 
funnel and then, as low-lying basins come to end, it will climb the surface 
again. In this way, funnel by funnel, the system can explore the total PES. 

Surveying the PES was performed first at T = 500 K, with the total 
length of the run of 5 x 10^ time steps. The basins were sampled in the 
order in which they were encountered when the system was confined to 
preceding basins. For this time, 408 minima and 4800 transitions states 
were found. 

To organize information about the explored fragment of the PES, we 
use the disconnectivity graph representation introduced in [38], Figure 23. 
This graph shows which minima are connected by pathways lying below 
a certain energy threshold. All minima are grouped into funnels depend- 
ing of the height of the barriers between them: the minima separated by 
relatively low barriers are considered as connected and thus related to the 
same funnel, and those separated by high barriers are considered as discon- 
nected and thus related to different funnels. The graph reveals five clearly 
defined funnels: the central one, which contains the a-helix and /3-strand 
conformers (Fig. 24), and four side funnels, that are separated from it by the 
barriers of the order of 10 Kcal/mol. The consideration of dihedral angles 
of the conformations associated with the minima has showed that all con- 
formations in each of the funnels are characterized by a certain set of values 
of oji, and these sets vary from one funnel to another. With the values of 
to = 180° and w = 0° corresponding, respectively, to the trans and cis form 
of a peptide group, we found that the central funnel is associated with all 
trans conformation of the TP, whereas the others represent mixed trans/ cis 
conformations. The successive confinement thus made it possible to cross 
the energy barriers which are as high as for the trans-to-cis transitions in 
peptides. 
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Fig. 23. Disconnectivity graph for the sampled fragment of the PES of the TP. 
Energy level spacing is 1 kcal/mol. a and f3 indicate the minima corresponding to 
the a-helix and /3-strand conformers, respectively. Stars indicate the minima that 
were found at the direct MD exploration of the PES. Figures label the funnels 
according to the rule given in the text. 



To compare the confinement and conventional approaches, the conven- 
tional Langevin MD simulations were also conducted. At the same total 
length of the run (5 x 10^ time steps) only 97 minima and 670 transition 
states were found, all located in the lower part of the central funnel, where 
the MD trajectory was started. These minima are indicated in Figure 23 
by stars. 
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Fig. 24. Stereo view of (a) a-helix (t/min = —127.5923 Kcal/mol) and (b) /3-strand 
(f/min = —126.8182 Kcal/mol) conformers. Prepared with help of the MOLMOL 
program [79]. 



To label funnels in Figure 23 (and in Fig. 25 below), binary digits are 
used. Namely, with 0 standing for the trans form of the peptide group and 1 
for the CIS form, the label of a certain conformation of the molecule is taken 
as the value of Zii 2 * 3 * 4 , where ik is equal either to 0 or 1 (A: = 1, 2, 3, 4), and 
the count is made from left to right. For example, label 0 means that all 
groups are in the trans form, 2 (= 0100) that the second group is in the cis 
form and the others are in the trans form, and 6 (= 0110) that the second 
and third groups are in the cis form and the other two are in the trans form. 

The next step was to find all 16 funnels corresponding to different 2^ 
combinations of the trans and cis elements. For this purpose a more “course” 
survey of the PES was employed in the framework of protocol II: the pref- 
erence was given to the basins with the lowest values of C/min- In this case 
once such a basin occurred to be in a different funnel, the system stopped 
further exploration of the given funnel and moved to this new funnel. Be- 
sides, to accelerate calculations, the temperature was elevated to T = 800 K. 
Figure 25 presents the disconnectivity graph built on the data obtained for 
6.4 X 10® time steps. 

Kinetics was studied at the normal (room temperature) conditions, T = 
300 K. For the total length of the run of 3.6 x 10^ time steps, 69 minima and 
289 transition states were found. In each of the basins the system made 150 
attempts to pass into other basins. The conventional simulation, to locate 
approximately the same number of minima and transition states (53 and 
217, respectively), took a time 3 times longer. 

Table 2 compares the transition probabilities for the transitions from 
the /3-strand conformer. There are shown channels for which statistics in 
the case of the conventional simulations could be considered as satisfactory 
(not less than 100 events). The data are in reasonable agreement. Note that 
the absolute numbers of events in the confinement simulations are less than 
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Fig. 25. Disconnectivity graph for the total PES of the TP. The notations are as 
in Figure 23. 



those in the conventional simulations approximately by factor of 50, whereas 
the length of the run was only 3 times shorter. This is a consequence of 
more homogeneous distribution of the events over basins in the confinement 
simulations as well as a reflection of the fact that the /3-strand conformer is 
thermodynamically most stable among the others (see below). 

Table 3 compares the equilibrium residence probabilities for a set of 
low-lying conformers that were calculated directly in the conventional sim- 
ulations and indirectly, from Eq. (7.16), in the confinement simulations. 
In general, the data are also in satisfactory agreement, approximately to 
within the statistical error (remind that in the confinement simulations the 
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Table 2. Transition probabilities, Q, and absolute numbers of the events, n, to 
escape from the basin for the /3-strand conformer. SW and SC label the results 
of the conventional and confinement simulations, respectively, t/min indicates the 
minimum energy (in Kcal/mol) of the conformer into which the system passes 
from /3-strand conformer. 



^min 


nSC 


nSW 






(/3) -126.8182 


961 


44792 


.861883 


.869849 


-125.8585 


65 


2874 


.058296 


.055812 


-125.8013 


73 


3016 


.065471 


.058570 


-124.8512 


7 


302 


.006278 


.005865 


-126.0183 


4 


223 


.003587 


.004331 


-125.9602 


4 


197 


.003587 


.003826 



Table 3. Equilibrium residence probabilities; correspond to the conventional 
MD simulations, and to equation (7.16) with the transition probabilities 
found in the confinement simulations (some of them are given in Tab. 2). f/min is 
the conformer minimum energy, a and /3 label the Q-helix and /3-strand conformer, 
respectively, n®'^ is the absolute numbers of the acts of the transitions into (from) 
a given basin that were observed in the conventional simulations. Energies are 
given in Kcal/mol. 

nSw psc psw 



(a) -127.5923 


20 


.00890 


.00654 


-125.0197 


77 


.00143 


.00158 


-124.8416 


16 


.00029 


.00018 


(/3) -126.8182 


6702 


.54508 


.54080 


-125.8585 


3098 


.04165 


.04052 


-125.8013 


3432 


.05840 


.05509 


-124.8512 


526 


.00567 


.00605 


-126.0183 


832 


.09673 


.08513 


-125.9602 


285 


.11417 


.11482 


-125.0428 


519 


.00848 


.00663 


-125.9658 


633 


.07392 


.08243 



number of the acts of the transitions from each of the conformers was equal 
to 150). 

As one can see from Table 3, though the a-helix conformer is lower 
in energy than the /3-strand one, the residence probability (or, that is the 
same, the relative residence time) for the latter is considerably larger. It is 
exactly the situation which we had in Section 6.2, when the chain structure 
for Ci 5 was populated higher than the ring structure. Equation (7.18) can 
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be rewritten as 

PP{T) = Z,{T)IZ{T) = exp{-F,/kBT)/Z 

where Fi = Fi(T) is the free energy of i-th conformer. Therefore, the 
residence time for a certain structure is determined by its free energy F = 
E — TS and not by energy E. The conformation entropy for the /3-strand 
conformer is evidently higher than that for the a-helix, therefore /3-strand 
turns out to be lower in free energy and, as a results, higher populated than 
the a-helix. 

8 Concluding remarks 

Going beyond the limitations imposed by the inherent dynamics and statis- 
tics, which restrict the capabilities of the existing simulation methods, the 
confinement technique offers a wide flexibility in the study of a many-body 
system, including the exploration of its PES and the calculation of the ther- 
modynamic properties and kinetics. Depending on the goal of the study, 
one can investigate either the total PES or a local part of it that is related to 
the structures and/or channels of transitions of interest. It is essential that 
information about the PES, thermodynamic properties and kinetics can be 
obtained simultaneously, i.e. in the course of a single molecular dynamics 
run. 

The approach is still in its infancy. One of the problems, which persists 
for the systems of large size, concerns the identification of inherent struc- 
tures. Generally speaking, every method which employs mapping of the MD 
or MG trajectory onto the minima on the PES encounters this problem. In 
this work, to identify the structures, we involved quenching. It is the most 
general approach, being applicable to atomic and molecular systems of dif- 
ferent nature, but at the same time, it is the most time consuming element 
in the simulations. Every quench usually takes the order of 10^ time steps. 
Therefore, if we, say, quench the system every 100 time step of numerical in- 
tegration of the equations of motion, the time is mostly spent for quenching. 
This is evidently unacceptable in the case of the system consisting of hun- 
dreds or thousands of atoms. Therefore, devising and development of time 
saving methods of identification of the structures appears to be a central 
problem for a future application of the confinement technique to systems of 
larger size. One method, suitable for the systems with well defined bond 
lengths, is the adjacency matrix method [80], in which the bonds between 
atoms are analyzed (broken or not) rather than the positions of the atoms. 

Let us also mention some directions of further development of the con- 
finement simulations. 

For an elementary cell to which the MD trajectory was confined, we 
considered a catchment basin. However, in principle, any suitably defined 
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region of configuration (or phase) space can be used for this purpose. For 
example, for biomolecules, the configuration (conformation) space can be 
separated into confinement regions by the values of dihedral angles or by the 
number of native contacts. The confinement simulations then can be used 
for a guided construction of so called statistical energy surfaces, which are 
defined in terms of energy, conformation entropy and a progress variable 
(with the number of native contacts usually serving for the latter). The 
knowledge of such surfaces is essential in the context of protein folding 
problem [33,34]. 

The potential energy and interatomic forces in the confinement simula- 
tions need not necessarily be given by a means of empirical or semi-empirical 
potential. They can also be calculated with the use of quantum mechanical 
treatment of electrons. Correspondingly, the ab initio molecular dynamics 
(AIMD) can be employed. Following Car and Parrinelo [81], various AIMD 
schemes have been developed that can be applied (and partly have been 
applied) to simulation studies of cluster dynamics (see, in particular, [82] 
and the papers cited therein). 

Though we considered finite systems, the approach is equally applicable 
to the study of massive blocks of material, in which case the system is 
subject to periodic boundary conditions (note that the SW approach was 
first applied to such systems [83]). 

Finally note that in many imaginable scenarios the confinement simu- 
lations are consistent with parallel-processing computations. First of all, 
this concerns the study of kinetic and equilibrium properties, where good 
statistics is required. 

I am grateful to the organizers of the School, Professor C. Guet and Professor P. Hobza, 
for the invitation to present this material to graduate and post-graduate students. Some 
results are new, in particular those obtained in recent work with Professor M. Karplus 
and Dr. S. Krivov [69], whose collaboration is gratefully acknowledged. I would like also 
to thank S. Krivov for fruitful discussions in the course of preparation of the manuscript. 
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1 Introduction 

Modern chemistry is based on the understanding of the chemical bond. The 
chemical bond implies the distribution and delocalization of electrons over 
the entire molecule resulting in a strong, i.e. covalent interaction. Mod- 
ern theoretical ab initio quantum chemistry methods have been extremely 
successful in describing the electronic structure of isolated molecules to a 
degree of precision that in some cases comes very close to high-resolution 
spectroscopic results. From the experimental viewpoint, there is quite a 
variety of spectroscopic methods to look at a molecule in the gas phase, i.e. 
under conditions where it can be considered unperturbed from collisions or 
other external forces, for instance, in molecular beams from supersonic jet 
expansions. 

Molecules can interact in two ways - reactively (leading to the formation 
or breaking of a covalent bond) or nonreactively (leading to the formation 
of a molecular complex). The former is clearly a covalent interaction; the 
later is termed non-covalent, physical or van der Waals (vdW) interaction. 
Such interaction leads to formation of molecular complexes called also vdW 
molecules. In these systems, the properties of the original subsystems are re- 
tained to a considerable degree. VdW molecules are considerably weaker [1] 
than “classical” molecules and a bonding of 15 kcal/mol and bond length of 

2 A can be very roughly considered as the boundary separating “classical” 
molecules from vdW molecules. 

1.1 The hierarchy of interactions between elementary particles, atoms and 
molecules 

Before considering vdW interactions more closely, an attempt will be made 
to consider them in the context of other types of interactions in the mi- 
crocosmos. Interactions between atomic nuclei and electrons lead to the 
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formation of atoms. Atoms form molecules. Both these interactions are 
much stronger than interactions between molecules leading either to for- 
mation of a new moelcule or to formation of a molecular cluster. In the 
former case a covalent bond is formed and broken while in the latter case 
no covalent bond is broken or formed. Stabilization energies of a molecular 
cluster are much smaller than other stabilization energies. 

1.2 The origin and phenomenological description of vdW interactions 

What is the source of the attractive effect [1] between molecules? Consider 
the overlap of the electron clouds in covalent and non-covalent interactions. 
If a covalent bond is formed between the subsystems when the electron 
spheres of the subsystems interact, it is then clear that the attraction must 
result from this overlap. As overlap is unnecessary in vdW interactions, 
the reason for attraction must be sought elsewhere. The source of the 
attraction in non-covalent interactions can lie only in the electrical and 
magnetic properties of subsystems. 

Electric charge in molecules is distributed very unevenly and represents 
what is termed an electric multipole; this may be dipole, quadrupole, oc- 
tapole, etc. Interactions between electric multipoles, termed electrostatic 
interactions can be attractive or repulsive and is given by the fol- 

lowing equation: 

= qRQT^ + qRPTr~'^ H f pri1tv~^ H h + ■ • ■ (IT) 

where g, p and Q designate charge, dipole and quadrupole moments, respec- 
tively of subsystems R and T {p’s are vectors and Q’s are tensors). The 
magnitude of the contributions in expansion (1) usually decreases sharply 
when passing to higher multipoles. 

Electric multipoles form an electric field in its surroundings. This field 
induces an electric multipole in other system in the vicinity of the original 
multipole. The ability of the system to be polarized is described by im- 
portant characteristics of the system, called the polarizability. There is an 
interaction between the original and the induced multipole; this is always 
attractive (in contrast to interactions between permanent mulptipoles) and 
is termed induction or polarization interaction. The multipole expansion of 
the induction energy {E^) is given by equation (2), 

= — 0.5a/{ -I- Px{3 cos^ 9t + l)r“® -I- . . . ] 

- O.Sot + m|:(3 cos^ 9r + l)r“® -I- . . . ] . (1.2) 

In equation (2) a, q, and p designate polarizability, charge and dipole mo- 
ments, respectively, of system R or T and 9t is the angle formed by the axis 
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of system T and the line connecting the centers of gravity of the two systems. 
Both above-mentioned components of the interaction energy, electrostatic 
and induction, can be described in terms of classical physics and both were 
defined at the beginning of this century. Experience, e.g. the possibility of 
liquefying rare gases, indicates that still another attractive force must exist, 
acting between uncharged, unpolar systems. London [2] gave a theoretical 
interpretation to these forces on the basis of quantum mechanics in 1930. 
As the same oscillator strengths appear in the expression for this force as 
in the expression for the dispersion of light, the energy corresponding to 
this interaction is termed the dispersion energy (E^). Similar to induc- 
tion energy, dispersion energy is always attractive. It is apparent from the 
multipole expansion of the dispersion energy 

= -Cgr-® - Cgr-s - Cwr~^° (1.3) 

that the terms of the expansion are proportional to the sixth, eight and 
tenth power of the reciprocal distance. 

It is apparent from equations (1-3) that electrostatic interactions act 
over the longest range, followed by induction and dispersion interactions. 
It would thus seem that dispersion interactions are not as important as the 
first two types; however the contrary is true. Dispersion interactions, e.g. 
between biomacromolecules, represent a very important component of the 
total interaction energy and are usually more important than the induction 
energy. 

Physical interpretation of the dispersion interactions is neither simple 
nor unambiguous; the origin of these interactions can be described quali- 
tatively as follows: the oscillation of atomic nuclei and electrons leads to 
the formation of a time-variable dipole (in general, multipole) even in non- 
polar, spherically symmetric systems. The value of this dipole depends on 
the instantaneous positions of the nuclei and electrons, varies in time and 
equals zero when averaged over time. At the instant when the time- vari- 
able multipole is not equal to zero, it acts inductively on the neighboring 
molecules, and the induced multipole interacts with the time-variable mul- 
tipole. This is thus interaction between a time-variable multipole and an 
induced multipole. 

So far, attractive forces have been considered; only electrostatic forces 
are repulsive for certain mutual orientations of the subsystems. If the inter- 
acting systems are not to collapse, there must be another force preventing 
the subsystems from approaching too closely. This force, termed exchange- 
repulsion, becomes important when the subsystems approach one another 
closely, to a distance at which the overlap of the electron spheres is non- 
zero. Interaction between two subsystems leads to an equilibrium between 
the attractive and repulsive forces, and a minimum appears on the curve 
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describing the dependence of the total interaction energy on the distance 
between the interacting systems. The minimum depth is related to the sta- 
bility of the given vdW molecule: complexes held together by dispersion 
forces have the lowest stabilization energy (1—4 kcal/mol, depending on the 
size of the molecule); vdW molecules containing hydrogen bonds are charac- 
terized by a stabilization energy of 2—8 kcal/mol per hydrogen bond; and, 
finally, ionic complexes, with a stabilization energy of up to 30 kcal/mol, 
are most stable. 

2 Calculation of interaction energy 

It was stated above that non-covalent, vdW interactions differ from cova- 
lent interactions. Here the question will be posed: can quantum chemistry 
describe non-covalent interactions as successfully as covalent interactions? 
The answer is unambiguously “Yes” . However, calculations in this field are 
one of the most difficult tasks facing quantum chemistry [3] . 

Just as in other regions of quantum mechanics and in quantum chem- 
istry as a whole, both methods for approximate solution of the Schrodinger 
equation (the variation and perturbation methods) are important in the 
study of non-covalent interactions. In the variation method, the interaction 
energy, AA, accompanying the formation of a supersystem by association 
of subsystems, is given by the difference between the energies of the su- 
persystem {E"^) and of the subsystems (the energy of the f-th subsystem 
is Ei) 



AE = E'^ - EE,. (2.1) 

This expression is very simple. However, the values of E"^ and EEi often 
differ by only tenth to thousandths of cal/mol and the value of E"^ for small 
and medium molecules equals 10® to 10^® kcal/mol; thus the calculation is 
numerically very difficult. A further difficulty lies in the fact that a method 
with central position in quantum chemistry, the SCF Hartree-Fock (HF) 
method, does not describe the correlation of the electron motion completely, 
and thus a certain portion of the total energy, called correlation energy, is 
neglected. Energies of atoms and molecules calculated by the HF method 
are subject to an error, termed the correlation error. The correlation en- 
ergy forms only a small portion (a few percent) of the total energy of the 
subsystems and of supersystem. However, participation of the correlation 
energy in the interaction energy is far more important. In some types of 
complexes {e.g. Ar. . . Ar), it is the only attractive force. Even in stronger 
complexes involving hydrogen bonds (water dimer, adenine. . .thymine), it 
forms an important part of the total interaction energy (10—30%). Thus, 
neglecting the correlation energy can lead to considerable errors. 




P. Hobza: Molecular Clusters 



571 



At first glance, the perturbation method appears far more suitable for 
calculation of non-covalent interactions. The interaction energy is calculated 
directly, rather than as the difference between two large, almost identical 
numbers. In this approach, AA is expressed as the sum of several con- 
tributions following from first and second order perturbation calculations, 
consisting primarily of the coulombic energy in which the electrostic 

multipole interaction forms the predominant attractive contribution, the in- 
duction energy {E^), the dispersion energy {E^) and the exchange-repulsion 
energy Despite of the problems involved in the variation method (see 

later), the vast majority of calculation of the interaction energy of various 
types of complexes is carried out using this method. This is because the vari- 
ation method is formally simple and straightforward, and because standard 
quantum chemical programs can be employed in this calculation. The per- 
turbational method is well suited for determining interactions between rigid 
systems, but its use is limited for systems for which many intramolecular 
and intermolecular degrees of freedom have to be optimized simultaneously. 
The perturbational approach, in the form of symmetry-adapted perturba- 
tion theory (SAPT) [4], has been used for highly accurate calculations of 
rigid systems, which provide a benchmark for supermolecular calculations. 

There are several advantages of the supermolecular approach which are 
profitable for studies of large clusters: i) it is applicable to any type of a 
molecular cluster; ii) it yields a wave function which can be used to derive 
various properties of the system; iii) highly accurate interaction energies can 
be obtained, provided that sufficiently large basis sets of atomic orbitals are 
used and a major portion of the of the correlation energy is covered; and 
iv) many-body interactions and charge transfer effects are explicitly taken 
into account. There are however, also drawbacks of this method and two 
inconsistencies should be mentioned: the size inconsistency and the basis 
set inconsistency. The size inconsistency does not represent any problem 
nowadays, since the most efficient methods for calculation of correlation 
energy are size-consistent (the HP method is size-consistent as well). The 
basis set inconsistency is a serious obstacle and leads to the basis set su- 
perposition error (BSSE). The BSSE is purely mathematical artifact due 
to the fact that different basis sets are used for energy evaluations of su- 
persystem and the subsystems. The supersystem possesses a larger basis 
set, which (artificially) increases its stabilization energy, and this effect is 
referred to as BSSE. Jansen and Ross [5], and Boys and Bernard! [6] inde- 
pendently introduced the function counterpoise (CP) method, which elimi- 
nates the BSSE. The principle of the method is simple: subsystems are not 
treated in their own basis sets but in the basis of the whole complex. The 
CP procedure is routinely used for evaluation of corrected stabilization en- 
ergy, which is, much less basis set dependent than uncorrected stabilization 
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energy. The use of CP procedure is, however, also important for determi- 
nation of the structure of a complex. Currently, complexes are optimized 
using the standard supermolecular gradient optimization and the BSSE cor- 
rection is applied only to a subsequent evaluation of interaction energy, i.e. 
after the optimization is finished. This is not correct, since the structure of 
a complex is optimized on the standard potential energy surface (PES) and 
not on the theoretically more justified CP-corrected PES. Optimization of 
the cluster structure with a small or medium basis set without consider- 
ation of BSSE can lead to a completely wrong structure. The problem of 
CP-corrected gradient optimization was solved recently [7,8] and the respec- 
tive method allows evaluation of the CP-corrected gradient and Hessians of 
a complex at any ah initio level. 

The correlation energy is much smaller than the HE energy; nevertheless 
the role of correlation energy is topical and cannot be neglected. The first 
treatment really applicable to large systems is the Mpller-Plesset (MP) 
perturbational theory [9] . The correlation energy is determined as a sum of 
the second, third, fourth, and higher contributions. The most economical 
and thus widely used is the simplest version of the MP method, the second- 
order MP theory (MP2). It is applicable to extended complexes (up to 
hundred of atoms) and gives surprisingly good estimates of the correlation 
energy. For accurate calculations the use of the coupled cluster theory [10] 
in the form of CCSD(T) method [11] is recommended. The method covered 
single and double excitations up to the infinite order, and the same is true 
for a part of the quadruple and hexatuple excitations; triple excitations are 
evaluated in a non-iterative way after performing a CCSD calculation. The 
method is iterative but nonvariational, size consistent, and represents the 
most robust tool for calculation of the correlation energy of large complexes. 

The choice of the atomic orbital (AO) basis set represents a compli- 
cated problem, and a basis set as large as possible should be used. The 
basis set applied should contain polarization and diffuse functions, and to 
obtain reliable stabilization energies, more sets of these functions are de- 
sirable. Among various basis sets, the correlation-consistent basis sets of 
Dunning [12], starting from the aug-cc-pvDZ (augmented correlation con- 
sistent polarized valence double-zeta), can be recommended. This basis 
set contains diffuse s-, p- and d-functions, which are essential for proper 
description of interaction energy. 

Use of lower-level theoretical methods (density functional theory, 
semiempirical quantum- chemical methods). The use of popular density func- 
tional theory (DFT) [13] in the realm of molecular clusters is very attrac- 
tive because the method is less computationally demanding than beyond 
HE methods. Because the DFT energy includes an exchange term and a 
contribution to the electron correlation energy, it was believed that it is 
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well suited for description of molecular clusters. The current DPT methods 
are known [14] to work well for H-bonded systems in terms of structure, 
dipole moments, energetics, and vibrational properties. Its general use for 
molecular clusters is very insecure because none of the existing functionals 
describe the London dispersion energy [14] . The DFT also fails for another 
important class of molecular complexes, namely charge transfer complexes, 
where these methods predict unrealistically deep energy minima. 

Use of currently available semiempirical quantum-chemical methods 
(such as AMI, PM3 or NDDO) for study of molecular clusters is also very 
limited since approximations used in these methods do not allow any rea- 
sonable description of interaction energy. 



3 Vibrational frequencies 

Formation of a molecular cluster gives rise to intermolecular vibrations. In- 
termolecular frequencies are usually much smaller than intramolecular ones 
and are typically around 100 cm“^, frequently even below 50 cm“^. Har- 
monic frequencies are easily determined even for large non-covalent clusters 
using Wilson FG analysis [15] and the respective procedure is available in 
various commercially distributed codes. Vibrational frequencies are, how- 
ever, not harmonic and “anharmonic” effects should be considered. How- 
ever, the term “anharmonic vibrations” is somewhat misleading since it 
implies the harmonic approximation as a starting point. Often, intermolec- 
ular vibrations in clusters involve hindered rotations and it makes no sense 
at all to use the language of harmonic oscillators. Similarly “anharmonic” 
corrections to the harmonic frequencies can often go into the wrong direc- 
tion. Given these cautions, here we keep the term “anharmonic” in the 
sense of non-harmonic. The PES obtained by high-level correlated ab initio 
calculations can be used for evaluating highly accurate anharmonic vibra- 
tional frequencies. The agreement between calculated frequencies and their 
experimental counterparts is usually much more satisfactory when anhar- 
monic effects are fully taken into account. This requirement is especially 
true for non-covalent clusters with a high degree of non-rigidity. 

The standard approach to the vibrational problem for large non-covalent 
clusters is perturbation theory [16, 17]. If the zero order Hamiltonian (usu- 
ally harmonic oscillator) is a good approximation to the true vibrational 
Hamiltonian, perturbation theory represents a very efficient and reliable 
tool for calculating vibrational frequencies. In the traditional approach, 
the matrix representation of the molecular Hamiltonian is diagonalized by 
means of successive contact transformation. This procedure will fail, how- 
ever, in the case of an accidental resonance. This is often the case if we deal 
with large systems with many vibrational modes. In this case the terms 
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connecting the resonant levels have to be treated variationally [17, 18]. Re- 
cently, considerable progress was achieved in developing new procedures 
based on perturbative treatment for the calculation of anharmonic frequen- 
cies [18-20]. The potential energy function is constructed as a low order 
polynomial (up to the fourth order) expressed in normal coordinates. The 
force constants are obtained by least-squares fitting of energies, gradients, 
and hessians calculated at geometries close to the global minimum on the 
PES. If it is not possible to work with extended basis sets (due to the 
cluster size) then energies, gradients and hessians should be obtained from 
the CP-corrected PES. The main advantage of that approach stems from 
its computational efficiency. The number of hessians required scales lin- 
early with the number of vibrational modes. Thus, the method can even 
be used for large systems while respecting the full dimensionality of the 
problem. The applicability of the procedure is, however, less straightfor- 
ward for non-covalent clusters since they are non-rigid systems. The vibra- 
tional dynamics of floppy systems cannot be described in the framework of 
a single-reference Hamiltonian and therefore, the perturbation series used 
are necessarily strongly divergent. In such a case the only alternative is a 
more exact treatment of the large amplitude vibrational modes including 
all relevant parts of the coordinate space. This requires calculation of the 
global PES that becomes computationally prohibitive even for systems with 
few degrees of freedom. However, the number of large amplitude motions is 
usually a small fraction of the total number of vibrations. Consequently, the 
large amplitude vibrations can be removed from the perturbative treatment 
and the Schrodinger equation for the effective large amplitude Hamiltonian 
is solved variationally. 

Literature on higher-dimensional anharmonic vibrational calculations 
of non-covalent clusters based on ab initio correlated calculations includes 
a variational six-dimensional intermolecular vibrational frequency calcula- 
tion for the adenine. . .thymine Watson-Crick base pair [21] and a twelve- 
dimensional vibrational frequency calculation for the water dimer by per- 
turbation theory [22]. Also a six-dimensional frequency calculation for the 
water dimer based on various empirical potentials was reported recently by 
LeForestier et al. [23] Other rigorous treatments of vibrations in clusters 
were reported. 



4 Potential energy surface 

Generally, the PES of molecular clusters is very rich and contains a large 
number of energy minima. The global minimum can generally be de- 
tected experimentally relatively easily, while elucidation of secondary min- 
ima (and saddle points) is much more difficult. These minima can be 
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theoretically ascribed only if the whole PES is known. The number of energy 
minima increases very rapidly with the cluster size and the number of sub- 
systems. While there is just one energy minimum on the water dimer 
PES [24-27] there are 11 minima on the uracil dimer PES [28], 284 min- 
ima on the benzene. .. Arg PES [29] and more than 1000 minima on the 
adenine. . .thymine. . . (H20)2 PES [30]. The localization of every minima 
is tedious, if not impossible, by standard methods based on experience and 
chemical intuition. It is necessary to use some effective search method for 
determining the geometry of the energy minima: these methods are based 
on computer experiments, mostly on molecular dynamic (MD) and Monte- 
Carlo (MC) simulations in combination with quenching technique. 

Generally, MD and MC simulations can be performed at any theoreti- 
cal level yielding energy and forces including nonempirical level. We have 
witnessed enormous progress over recent years in so called ab initio MD, 
specifically in first-principles MD by the Car-Parrinello method [31,32]. 
This approach represents very significant progress particularly because it 
is parameter-free, includes all many-body terms as well as charge-transfer 
and can be used even for formation and/or breaking of covalent bonds. 
The use of the Car-Parinello method for molecular clusters is, however, 
limited due to the fact that the plane-wave density functional method (as 
well as any other density functional method) does not cover the London 
dispersion energy (see later). The use of the method for charged or polar 
clusters where dispersion energy plays minor role is justified [33,34]. The 
vast majority of MD simulations are (and will be in the near future) based 
on an empirical potential. Such an approach is applicable for molecular 
clusters but cannot be applied for MD simulations of chemical reactions. 
The quality of MD simulations depends critically on the performance of the 
simulation technique but also on the quality of the empirical potential used. 
This fact is very frequently ignored and it is believed that long MD simu- 
lations always yield reliable results. However, the combination of high-level 
MD simulations with poor potential leads only to poor results. An empiri- 
cal potential should correctly describe all intermolecular and intramolecular 
degrees of freedom for any type of molecular cluster. These requirements 
are extremely strict and usually only a certain empirical potential is ap- 
plicable to a specific class of molecular clusters. We must bear in mind 
that more elaborated techniques of quantum chemistry {e.g. semiempirical 
or ab initio Hartree-Fock) also fail to simultaneously describe all types of 
molecular clusters {e.g. H-bonded and London dispersion clusters). Present 
empirical potentials also fail for charge-transfer complexes and in situations 
where many-body terms are important. Probably the only way to control 
the quality of an empirical potential is to compare its performance (stabi- 
lization energy, structure and geometry, vibrational frequencies) with that 
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of nonempirical correlated ab initio calculations. Comparison should be 
made for selected representatives of the studied class of molecular clusters. 

Amar and Berry [35] introduced a very efficient quenching technique for 
the evaluation of cluster structures; the method was originally developed 
by Stillinger and Weber [36] for liquids. The principle of the technique 
is as follow: in the course of MD simulation, after an arbitrary number 
of steps, the simulation is stopped and energy minimization (quench) is 
performed. The minimal energy and coordinates of the minimum found are 
stored and then the MD starts again from the point where it was stopped. 
After a sufficiently long simulation, a list of nearly all stationary points 
of the cluster is obtained. The completeness of the list could be verified 
by starting the quenching at a different temperature and by changing the 
number of steps. 



5 Free energy surface 

Populations of various structures of a cluster which are proportional to the 
change of the Gibbs free energy can be determined by computer simulations 
for NVT canonical and NVE micro-canonical ensembles (N, V, E and T refer 
to the number of molecules in a system, its volume, energy and tempera- 
ture) combined with the quenching method [35,36]. In the NVT canonical 
ensemble, the cluster is in temperature equilibrium with the surroundings, 
and accordingly, the NVT ensemble gives information of the behavior of 
the cluster when it is interacting with the surrounding. In the NVE micro- 
canonical ensemble all the systems have the same energy and each system 
is individually isolated. Clearly, the NVE and NVT ensemble results must 
coincide in the limit of infinitely large systems [37,38]. However, small 
molecular clusters are all but infinite size systems. This has important con- 
sequences for the probability P{E) = D,{E) exp{—E/kT) (where D,{E) is 
the density of states with energy E) of finding the NVT ensemble at en- 
ergy E. Whereas small systems approach the one degree of freedom limit 
P{E) = exp{—E/kT), large systems with high densities of states possess 
canonical probabilities which approach a J-function peaked at the energy of 
a corresponding microcanonical ensemble [38] . 

The calculation of populations from quenching is possible in a rather 
narrow energy interval. The energy should be sufficiently high to allow a 
high frequency of interconversions among different isomers. The basic ad- 
vantage of this technique is the fact that it enables an evaluation of relative 
populations for all structures on the PES, i.e. to pass from the PES to the 
free energy surface (FES). Such calculations require long simulation runs 
and are thus feasible only for relatively small clusters. Using the free en- 
ergy perturbation/MD procedure we obtain relative AG values for much 
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larger clusters but only for small sections of the PES. These calculations 
need significantly less simulation time. 

The comparison of theoretical and experimental results require some 
comments. First, the temperature T of the cluster formation in a particular 
experiment should be known. The stabilization enthalpy at 0 K and the PES 
characteristics can be only used if T is zero or very low. In many cases, 
however, temperature is higher and entropy term should be considered. 
This means that the FES characteristics and not the PES characteristics 
should be utilized for comparison with experimental results. Care should 
be further paid to the type of simulations, NVE or NVT. Simulations in 
the NVE ensemble give properties of a cluster that does not interact with 
the surroundings while simulations in the NVT ensemble correspond to a 
situation when the cluster is in thermal equilibrium with the surroundings. 

6 Applications 

6.1 Benzene. .. Arn clusters 

Benzene . . . Ar 2 [29] . Two structures of the titled dimer were obtained by the 
gradient optimization of the potential (obtained by fitting to the ah initio 
MP2 PES of the benzene. . . Ar cluster) in the form of equation (5). 

U = -B{1- C/R) /R^. (6.1) 

The global minimum {AE = —2.25 kcal/mol) corresponds to the struc- 
ture having both argons on the Ce axis and localized on the opposite sides 
of the benzene molecule (structure (Ijl). The local minimum (AE = 
—2.03 kcal/mol) corresponds to the structure having both argons on one 
side of the benzene molecule; abbreviation (2|0) is used for this structure. 
The MD simulations shows that less stable local minimum is more populated 
than the global minimum due to an entropy term. 

Benzene. .. and Benzene. .. clusters [29]. The global minimum 
and the first local minimum of the former complex (both having all three 
argons on one side of the benzene molecule - abbreviation (3|0)) were pop- 
ulated less than the second local minimum having a two-sided structure 
(2|1). The relative populations of these stationary points are 40, 13, and 
46%. The first local two-sided minimum of the latter complex with the (4|1) 
structure is populated more than that of the global one-sided one having 
(5|0) structure. Similarly as in the previous cluster three lowest-energy min- 
ima are populated much more than the remaining stationary points. For 
both clusters we obtained a similar picture - the two-sided local minima 
were populated more than one-sided global minima. 

Benzene. .. Ars [29]. The quenching technique localized 156 stationary 
points for the cluster at 25.9 K. The clearly dominant population 
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corresponds to minimum (3|2|3) having a bridgelike (pocket) structure. This 
local minimum is 0.17 kcal/mol less stable than the global minimum with 
the symmetrical (7|1) structure. Seven argons on one side constitutes the 
full one-side solvation, forming a “flowerlike” structure which is energetically 
very favorable. Evidently, the global minimum (7|1) could have only two 
realizations while the bridgelike structure (3|2|3) could have considerably 
more. According to the Boltzman relation among entropy (S'), Boltzman’s 
constant (kh), and number W, which is the number of ways in which the 
systems in the sample can be arranged still having the same total energy, 

S = fcb In kb (6.2) 

the entropy of the (3|2|3) structure is expected to be higher than that of 
the (7|1) structure. (Let us add that the same argument is valid also for 
lower benzene. . . Ar„ clusters with inclusion of the benzene. . . Ar 2 one.) 

No one-sided structure of the complex was observed during simula- 
tions. This conclusion agrees with the experimental data in [39]: “in the 
benzene. . . Arg spectrum, no feature is observed in the frequency range cor- 
responding to the one-sided isomers” . 

Concluding remarks. With the exception of the very low temperatures, 
the global minimum of the benzene. . . Ar„ clusters was not most polulated. 
The low populations of the global minima having one-sided structures and 
high populations of two-sided and bridged structures are explained by the 
entropy term. The PES and FES of titled clusters differ considerably and 
benzene. . . Arg represents a textbook example where the global minimum 
is populated 5-8 times less than the less stable minimum with a bridgelike 
structure. 

6.2 Aromatic system dimers and oligomers 

The benzene-benzene interaction is of key importance as a prototype of 
interaction prevailing in the aromatic 7r-systems [40]. Its understanding 
is crucially important for the interpretation of diverse phenomena such as 
base pair stacking in DNA, intercalation of drugs into DNA, crystal packing 
of aromatic molecules, formation of tertiary structure of proteins or por- 
phyrin aggregation. Despite extensive experimental [41-51] and theoretical 
efforts [52-60] our knowledge of the structural and dynamical properties of 
the benzene dimer is still limited. This concerns even the structure of the 
global minimum and the number of other stationary points on the potential 
energy surface. 

The first step toward understanding the dimer structure was achieved by 
Arunan and Gutowsky [50] who reported well resolved rotational spectra of 
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the dimer. With the pulsed nozzle Fourier transform microwave spectrome- 
ter, the authors obtained rotational constants from which the center of mass 
separation was estimated. The evaluated distance was consistent with the 
T-shaped dimer structure. 

A major step forward has recently come from the investigation of the 
structure and vibrational dynamics of the benzene dimer using the non- 
empirical model (NEMO) empirical potential parameterized using CCSD(T) 
calculations [61]. In this study, gradient and hessian calculations using 
the NEMO potential revealed the existence of just one energy minimum 
corresponding to the T-shaped structure. The parallel-displaced structure 
which has been believed to be an energy minimum (and sometimes even 
the global minimum) was clearly identified as a transition state structure 
separating two equivalent T-shaped dimer structures. 

Due to a relatively high transition barrier (~170 cm“^), interconversion 
tunneling is unimportant in the energy region spanned by the available 
rotational spectra [50] and is thus neglected. The dimer undergoes a nearly 
free internal rotation along the axis connecting the benzene centers of mass 
in the T-shaped equilibrium geometry and a hindered internal rotation (the 
barrier being ~46 cm“^) along the axis that is perpendicular to the “nearly 
free” internal rotation axis. 

The tunneling splitting observed in the rotational spectrum [50] is prob- 
ably due to this hindered rotation. An analysis assuming the latter rotation 
as an independent motion and using purely vibrational tunneling splitting 
indicates that the genuine value of the hindered rotation barrier is nearly 
twice higher than its ab initio value. Similarly, the difference AR = 0.25 A 
between the equilibrium ground state value for the distance of the mass 
centers of the benzene monomers from ab initio calculation and experiment 
is strong evidence that the theoretical potential is much shallower than the 
correct one. In a complimentary stimulated Raman study, some bands were 
observed around 3 to 10 cm“^ region [49]. These bands can now be assigned 
to the nearly free rotation and the “energy minimum path” bending motion. 

Benzene trimers and tetramers were studied [62] using the same empir- 
ical potential used for the benzene dimer. The important advantage of the 
NEMO potential is that it includes induction energy, allowing the recovery 
of the many-body energy terms. The induction three-body term is, how- 
ever, only one of many three-body terms. The others, involving exchange 
and dispersion might be larger but their evaluation (especially dispersion) 
is rather difficult. For the benzene trimer, three energy minima were found. 
The most stable cyclic trimer structure explains the experimentally observed 
spectral shifts and binding energies [63] . The calculated harmonic frequen- 
cies conform to the experimentally found intermolecular vibrations [64]. 
For the tetramer five structures were obtained. The experimentally found 
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conformer [65, 66] does not represent the cyclic lowest-energy structure but 
rather a cyclic trimer with a monomer attached to one side. This can be 
understood from entropy considerations. These calculations show that a 
carefully calibrated empirical potential is well suited to describe larger ben- 
zene clusters that are out of reach for accurate ab initio quantum chemical 
calculations. 

Many-body terms were shown to be important in the case of water clus- 
ters. When the polarity of the subsystems is decreasing, many-body terms 
are expected to be less important. We have, however, shown [62] that for 
the cyclic benzene trimer the three-body term contributes about 5% to the 
overall stabilization. Moreover, the three-body contribution can be attrac- 
tive as well as repulsive. Three-body terms in benzene oligomers are thus 
not negligible. Theoretical analysis has further shown [62] that the four- 
body terms can be omitted. The importance of three body interactions was 
confirmed in a study of the naphthalene trimer using ab initio calculations 
and an MM3 force field [67]. The lowest energy structure was found to be 
the Qzh cyclic structure which is consistent with experimental results [68] . 

6.3 Nucleic acid-base pairs 

As is well known, genetic information is stored and duplicated in DNA, 
which can thus be considered one of the most important molecules in our 
life. The functionality of DNA is predetermined by its unique 3-dimensional, 
right-handed, double helical structure. The structure of DNA is influenced 
by various contributions, among them the H-bonding and stacking interac- 
tions of nucleic acid (NA) bases play an important role. 

The experimental characterization of base pairs is extremely difficult and 
only very few reliable studies are available. There is still only one published 
gas phase experiment on the energetics of H-bonded NA base pairs [69] and 
gas phase experimental data on base stacking are absent; the same is true for 
the structure of NA base pairs. Presently high level quantum chemical cal- 
culations and computer simulations represent the only tools to obtain some 
reference data on structure, energetics, vibrational frequencies and other 
properties of NA base pairs. These data are essential for understanding the 
function and properties of NA base pairs and are also very important for 
the verification and/or parameterization of empirical potentials for molec- 
ular modeling of bio-macromolecules and their interactions. 

The competition between base stacking and H-bonding of NA bases was 
first investigated by ab initio methods about ten years ago. More recently, 
ab initio calculations on these clusters including correlation energy have 
become feasible and have produced more reliable results [70,71]. These 
calculations now essentially provide a conclusive picture of the interaction 
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of NA bases in the gas phase and can be summarized as follows: 

(i) H-bonded pairs stabilized by electrostatic interactions are more sta- 
ble than the stacked structures, which are stabilized by dispersion 
interactions; 

(ii) for a consistent description of H-bonding and stacking interactions, 
the theoretical procedure must include the London dispersion energy, 
ruling out the use of Hartree-Fock ab initio and density functional 
methods. Also semi-empirical methods of quantum chemistry cannot 
be used; 

(iii) intermolecular vibrational frequencies of H-bonded base pairs are al- 
most constant and do not depend on stabilization energy. For all NA 
base pairs, the buckle and propeller twist vibrations are the lowest 
and they are all in the region of 4—30 cm“^ in the harmonic approx- 
imation; 

(iv) in-plane intermolecular harmonic vibrations of the adenine- • • thymine 
NA base pair are systematically overestimated with respect to anhar- 
monic vibrations by about 50% while the absolute differences among 
anharmonic and harmonic intermolecular out-of-plane vibrations is 
lower than 15%. The 50% error in vibrational frequencies is large 
and care should be paid to the use of harmonic intermolecular vibra- 
tional frequencies of NA base pairs. All anharmonic intermolecular 
frequencies are, however, systematically overestimated in comparison 
with harmonic ones. This provides an opportunity to use the standard 
harmonic approach to obtain at least an approximate description of 
the lowest vibrational frequencies of NA base pairs. It must be remem- 
bered that anharmonic treatment of such large clusters is extremely 
tedious; 

(v) the stability order of hydrogen-bonded NA base pairs is not changed 
when passing from the AA to the AG description. Though the dif- 
ferent NA base pairs vary greatly in stabilization energy, the entropy 
term is always nearly constant; 

(vi) the entropy contribution differs considerably for hydrogen-bonded and 
stacked structures. This implies that energetically less favorable 
stacked structures can be favored compared to the H-bonded ones 
when passing from the AA to the AG description. To obtain the 
free energy and hence a reliable thermodynamic characterisation for 
H-bonded and stacked base pairs, computer experiment simulations 
are required, mainly molecular dynamics simulations; 
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(vii) the order of stability of various dimer structures of NA base pairs is 
different for the PES and FES. This indicates that experimental data 
should be compared with the FES and not with the PES. However, 
this would only be correct for experiments carried out at temperatures 
much higher than 0 K. The necessity to use the FES in contrast to the 
PES will be particularly important for those NA base pairs for which 
the difference in stabilization energy between H-bonded and stacked 
structures is small and hence the entropy contribution becomes most 
significant. In these cases, even the structure of the global minimum 
may be different for both surfaces; 

(viii) among various empirical potentials tested, the Amber potential with 
the Cornell et al. force field [72] was found to best reproduce the 
ah initio H-bonding and stacking stabilization energies and is at 
present recommended for computer simulations of DNA and RNA. On 
the basis of the analysis of ab initio and empirical potential data, it is 
evident that future force field generations should include an anisotropy 
and a polarisation term. The polarisation term would allow the inclu- 
sion of many-body effects; 

(ix) it is surprising that such a good agreement between theoretical and ex- 
perimental values concerning structure and function of DNA oligomers 
is found for standard MD simulations since only pair potentials are 
applied. This agreement is even more surprising when the highly ionic 
and polar character of DNA and surrounding water is considered. It 
is thus quite possible that the agreement obtained is fortuitous and 
is (as frequently in the field of non-covalent interactions) due to com- 
pensation of errors. 
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